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Preface 


Two subjects are treated in this book. The main subject is the theory of Bernoulli 
numbers, which are a series of rational numbers that appear in various contexts of 
mathematics, and the other subject is the related theory of zeta functions, which are 
very important in number theory. We hope that these are enjoyable subjects both for 
amateur mathematics lovers and for professional researchers. There are easy parts 
as well as difficult parts in this book, but we believe that, according to the taste of 
the readers, they can enjoy at least some parts of this attractive mathematics. Since 
the logical relations between the chapters are rather loose and not a straight course 
from the beginning to the end, it would still be worthwhile for readers who pick 
only some chapters which fit their tastes and background knowledge. As far as we 
know, books whose main subject is Bernoulli numbers are rare. Some parts of this 
book consist of rather standard number theory, but our expositions on these subjects 
are not always so standard, and some parts are completely new and have not been 
written in any reference before. 

Now, there are many numbers in the world which have names, but what is a 
Bernoulli number? In high-school we learn the formula for the sum of the integers 
from 1 to 77 . Or maybe the formula of the sum of squares l 2 + 2 2 + • • • + 77 2 is 
also written in the standard textbooks for high schools. But if we make the powers 
bigger, for example, if we take 


l 6 T 2 6 T T 77 6 , 

then what kind of formula can we have? It seems that many people hit upon this kind 
of question and it often happens that ambitious high-school students try and find the 
solution. In fact, the formula was already known for any power at the beginning 
of the eighteenth century. (See Chap. 1, p. 2.) Bernoulli introduced some special 
numbers to express this formula. These are the Bernoulli numbers. (For a more 
precise history, see Chap. 1 .) In this book, this formula of the sum of powers appears 
in various places. In fact, in this book, four alternative proofs of this formula will be 
given in different places (cf. Sects. 1.2, 4.3, 5.2, 8.3). 


VI 
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Well, is it so interesting to give this formula of the sum of powers? Indeed it is. 
But if the meaning of Bernoulli numbers were only this, we would not have written 
this book. The real meaning of Bernoulli numbers lies in a different place. Let’s 
look at the following formulas: 

11111 _ jr 2 

l+ 2 2 + 3 2 + 4 2 + 5 2 + 6 2+ '"~~6’ 

11111 _ it 4 

l + 2 4 + 3 4 + 4 i + ¥ + 6 4+ "'~ 90’ 

111111 7 T 

1 — — — — -(- — ■ • ■ — — . 

3 5 7 9 11 13 4 

(You can find the proofs in Corollary 4.12, p. 61, and Theorem 9.6, p. 148.) At 
first glance, these kinds of formulas could give you a mysterious impression, since 
a regular sum of simple rational numbers suddenly changes into a transcendental 
number like jt. The left-hand sides of these formulas are values at some special 
points of so-called zeta functions, or L -functions, which are another important 
subject of the book. An example of zeta functions is given by 


11111 

£(s) — 1 + — — F — — V - — V - — V — — b 
2 s 3 s 4 s 5 s 6 s 


Here the sum is regarded as a function with respect to the complex variable ,v. This 
function is called the Riemann zeta function. The left-hand sides of the first two 
formulas above are the values of £(s ) at s = 2 and s = 4, namely £ (2) and £ (4). The 
right-hand sides are related to Bernoulli numbers and generalized Bernoulli numbers 
(see below). The true reason that Bernoulli numbers are indispensable for number 
theory lies in this type of relation with special values (values at special points) of 
zeta functions. A huge number of functions called zeta functions appear in number 
theory, and they always contain very important information. We can even say that 
there appears a different zeta function for each important number theoretical object. 
The zeta functions which are related to Bernoulli numbers are just a part of them. 
But still they have something to do with many number theoretical phenomena. For 
example, they are connected with a kind of volume in non-Euclidean geometry, with 
dimension formulas of spaces of modular forms, and with the number of equivalence 
classes of quadratic forms (class number formula), and they appear as constant terms 
of Eisenstein series, which are important in the theory of automorphic forms. Also 
they are related to the number theory of cyclotomic fields, that is, the systems of 
numbers obtained by adjoining the roots of unity to the rational numbers. If we 
wanted to explain all of this, we would need several more books, so it is not possible 
to do it here, but we have tried to give some hints about those relations. 

Now, for the reader’s convenience, we sketch the content of each chapter. 
Comments below written in parentheses will not be explained in this book in detail. 
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In Chap. 1, after explaining the history of how Bernoulli numbers were intro- 
duced and giving the definition of Bernoulli numbers using recurrence relations, 
we prove the formula for the sum of powers of consecutive natural numbers. 
In Chap. 2, we give a formula to write down Bernoulli numbers in a simple 
way. There we use Stirling numbers , whose definition is comparatively simple. In 
Chap. 3, we will prove the Clausen-von Staudt theorem which gives information 
on the denominators of Bernoulli numbers, and also Kummer’s congruence which 
gives a congruence relation between Bernoulli numbers. (This latter theorem is 
needed when we define p - adic zeta functions.) In Chap. 4, we define generalized 
Bernoulli numbers associated to Dirichlet characters. (Through this extension, 
the connection with quadratic fields and cyclotomic fields will appear.) Also the 
Bernoulli polynomials (a system of polynomials similar to the generating function 
of Bernoulli numbers) are defined, and as an application, we give a second proof 
of the formula of the sum of powers. The formula expressing the values of the 
Riemann zeta function at even positive integers by Bernoulli numbers is also given 
here. In Chap. 5, we give the Euler-Maclaurin summation formula, which is very 
useful to evaluate sum of values of functions at integers. In the coefficients of this 
formula, Bernoulli numbers or Bernoulli polynomials appear. As an application, the 
analytic continuation of the Riemann zeta function to the whole plane is proved 
and the values of the Riemann zeta function at non-positive integers are beautifully 
expressed by Bernoulli numbers. In Chap. 6, the relation between ideals of a 
quadratic field and quadratic forms is given and a relation between the generalized 
Bernoulli numbers and the class numbers of positive definite quadratic forms is 
explained. In Chap. 7, a congruence relation between the class number of imaginary 
quadratic fields and the Bernoulli numbers will be given. In Chap. 8, Gauss sums 
are introduced and then formulas to describe various sums of the roots of unity with 
characters by Gauss sums and generalized Bernoulli numbers are given. Here we 
get a third proof of the formula for sums of powers. In Chap. 9, L-functions will 
be introduced by modifying the usual definition of the Riemann zeta function by 
adjoining characters; then their functional equation is shown by using a contour 
integral expression. We will also give a formula to describe values at non-positive 
integers of L-functions in terms of generalized Bernoulli numbers. This formula 
gives a “raison d’etre” for the generalized Bernoulli numbers. In Chap. 10, among 
the zeta functions of prehomogeneous vector spaces, we pick an example which 
apparently does not seem so simple but can be shown to be a very easy function. 
(This is an example connected with the appearance of Bernoulli numbers in the 
dimension formula for modular forms.) In Chap. 1 1, we give an alternative proof of 
Kummer’s congruence by considering p-adic measures. (This has a deep connection 
with p - adic L-functions.) In Chap. 12, we review the fact that Bernoulli numbers 
appear in the Taylor expansion of the (co)tangent function, and ask what would 
emerge if we replace the (co)tangent with elliptic functions (a well-worn device 
to generalize phenomena on trigonometric functions), and explain the theory of 
Hurwitz numbers. In Chap. 13, we show the analytic continuation of the Barnes 
multiple zeta function, which is a generalization of the Hurwitz zeta function, and 
explain relations between their special values and Bernoulli polynomials. We also 
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show the functional equation of the double zeta functions in the same way as in 
Chap. 9. In Chap. 14, we define poly-Bernoulli numbers, which are obtained by 
replacing the usual generating function of Bernoulli numbers by polylogarithm 
functions. In an Appendix by Don Zagier, some “curious and exotic” identities for 
Bernoulli numbers are given. 

As explained above, Bernoulli numbers are related with a great many things. 
But Bernoulli numbers do not have so many of their own problems to be solved, 
except for a few very difficult problems. So, maybe there exist few mathematicians 
who study only Bernoulli numbers. Since the situation is like that, there are not 
many books similar to this one, and it was not so easy to have a perspective on 
the wide range of relations for Bernoulli numbers. Although the present authors 
are specialists of number theory, we dare not say we are specialists of Bernoulli 
numbers. This book consists partly of quotations from the papers by authors 
who occasionally encountered Bernoulli numbers from their different interests, 
and partly of various memoranda (namely these are observations which are not 
necessarily very original results worthy to be written as careful scientific papers, 
but rather lightly checked, interestingly interpreted, or calculated for fun and worth 
being taken note of so as not to be forgotten). And this kind of connection is 
indeed the usual common connection of mathematicians to the Bernoulli numbers. 
In this sense, we believe that this book is useful for any reader who is interested in 
mathematics. 

The original edition of this book was published in lapanese by Makino Publisher 
in Japan in 2001. We would like to give our sincere thanks to Professor Don B. 
Zagier for recommending us enthusiastically to write this English version based 
on that Japanese book, for recommending this book to Springer- Verlag to be 
published, and also for writing the nice Appendix. We would like to thank Springer- 
Verlag for publishing this book, in particular Dr. Joachim Heinze of Springer 
and Ms. Chino Hasebe and Mr. Masayuki Nakamura of Springer Tokyo for their 
valuable help for the arrangement, and Dr. Alexander Weisse for his help with TeX. 
We would also like to thank anonymous referees of the English version for a lot 
of valuable comments. We would like to give our deep thanks to Mr. Suenobu 
Makino, the owner of Makino Publishers, who originally suggested us the theme 
of Bernoulli numbers and recommended to write the book in Japanese. Without his 
recommendation, the Japanese version would not have appeared. Tsuneo Arakawa, 
one of the co-authors for the Japanese version, passed away on October 3, 2003 at 
the age of 54 before this English project started. Since some parts of this book are 
based on his Japanese manuscript, we would like to keep his name as one of the 
authors. The second- and the third-named authors express their profound regret and 
sadness over the early death of Tsuneo Arakawa. 


November 2013 


The Authors 
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Chapter 1 

Bernoulli Numbers 


1.1 Definitions: Introduction from History 


In his posthumous book Ars Conjectandi [16] published in 1713 (the law of large 
numbers in probability theory is stated in this book), Jakob Bernoulli 1 introduced 
the Bernoulli numbers in connection to the study of the sums of powers of 
consecutive integers \ k + 2 k + ■ ■ ■ + n k . After listing the formulas for the sums 
of powers 


E« 


n(n + 1) 
2 


E * 2 


n(n + 1)(2» + 1) 


E- ! = ( 

i = i v 


n(n + 1) 



up to k = 10 (Bernoulli expresses the right-hand side without factoring), he gives 
a general formula involving the numbers which are known today as Bernoulli 
numbers. Bernoulli then explains how these numbers are determined inductively, 
and emphasizes how his formula ((1.1) below) is useful for computing the sum 
of powers. He claims that he did not take “a half of a quarter of an hour” to 
compute the sum of tenth powers of 1 to 1,000, which he computed correctly as 
9 1 40992424 1 42424342424 1 924242500. 

Using modern notation, his formula is written as 


E f * 



B, 


„A'+W 

k + \- j 



'Bom on December 27, 1654 in Basel, Switzerland — died on August 16, 1705 in Basel, 
Switzerland. Jakob is the eldest among the mathematicians in the famous Bernoulli family. It 
is said that Jakob, his younger brother Johann (bom on July 27, 1667 in Basel, Switzerland — 
died on January 1, 1748 in Basel, Switzerland), and his second son, Daniel (bom on February 8, 
1700 in Groningen, Netherlands — died on March 17, 1782 in Basel, Switzerland) are the most 
distinguished among them. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
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1 Bernoulli Numbers 


where (^) is the binomial coefficient 


/ k\ k(k — l)(fc — 2) ■ ■ ■ (k — j + 1) 

w ~ ^ 


and Bj is the number determined by the recurrence formula 



k + 1, k = 0, 1 , 2 ,.... 


It is this that is subsequently called a Bernoulli number, 2 and is the main theme 
of this book. 

Bernoulli does not give symbols for Bq and B\, and he writes A, B, C, D, . . . 
for B 2 n- As we will see later, Bi n +\ = 0 except for B\ . In his book, the right-hand 
side of (1.1) is written as 


1 r+l 1 f c t r _, c.c 
n +1 — n H — An 1 H 


l.c-2 


-5n c ~ 3 * + 


c + 1 2 2 

c.c — l.c — 2.c — 3.c — 4 


2 . 3 . 4 . 5. 6 


Cn c ~ 5 + 


2.3.4 

c.c — l.c — 2. c — 3.c — 4,c — 5.c — 6 


2.3.4.5.6.7.S 


-Dn 


c-1 


+ ' 


where c = k. 

We would like to mention here that, in the book called Katsuyo Sanpo (“essentials 
of the art of calculation”) by the outstanding Japanese mathematician Takakazu 
Seki, 3 published also posthumously, in 1712 (and thus 1 year before Bernoulli!), the 
formula for the sums of powers and the inductive definition of the Bernoulli numbers 
are given. His formula and definition are completely the same as Bernoulli’s. 


2 It was apparently de Moivre (Abraham, bom on May 26, 1667 in Vitry-le-Francois, Champagne, 
France — died on November 27, 1754 in London, England) who first called this number a Bernoulli 
number in the book Miscellanea analytica de seriebus et quadraturis (London, 1730). De Moivre 
is famous for de Moivre’s formula in trigonometry. 

3 Bom in 1642(?) in Kohzuke(?), Japan — died on October 24, 1708 in Edo, Japan. Fie is usually 
considered Japan’s greatest mathematician of the Edo period (1600-1857, when the country 
was closed to essentially all foreign contact). Flardly anything is known about his mathematical 
education, and he seems to have been largely self-taught. Fie served under the shoguns Tsunashige 
Tokugawa and Ienobu Tokugawa, occupying the post of Controller of the Treasury Office, and 

wrote several treatises in higher mathematics. Apart from discovering Bernoulli numbers simul- 
taneously with or before Bernoulli, he discovered determinants and the mles for calculating them 

simultaneously with or before Leibniz, solved extraordinarily difficult problems of elimination 
theory for systems of polynomial equations in many variables, and began a study of calculation 
procedures for the arc of a circle that was continued and completed by his disciple Katahiro Takebe 
with the discovery of infinite series expansions for various trigonometric functions. 
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Seki refers to Bo, B \ , Bi , . . . as Shu s an (which means “numbers to be taken”) of the 
first order, second order etc., labeling odd indexed i? 2 «+i as well. It is not widely 
known that Seki independently found the Bernoulli numbers, but the collected works 
of Seki [84] have been published and the volume contains an English translation 
of each article. A reproduction of Seki’s table giving the formula for the sums of 
powers in terms of binomial coefficients and “Seki-Bernoulli numbers”, together 
with a translation into modern notation, is given as Figs. 1.1 and 1.2. 

Neither Seki nor Bernoulli explains in much detail how to deduce the formula 
(1.1). Prior to their work, the formula for the sums of powers was discussed in 
Academia Algebrae by Faulhaber. 4 He obtained the result saying that, when k is 
odd, the quantity Yl"=\ A is a polynomial in 'iklAll = Y^"=\ i, (the first example 

being ^" =1 r 3 = (^" =1 i) 2 ), whereas when k is even, i k is divisible 

by " ( ''+ 1) 6 (2 "+ 1) = Yl'i = i i 2 as polynomials in n, and the quotient is again a 
polynomial in ^” =1 i. But apparently he did not reach the Bernoulli numbers. 
These facts discovered by Faulhaber were rediscovered by Jacobi 5 [52], who gave 
a rigorous proof. We shall provide a proof of the formula (1.1) in the next section. 
As for the results of Faulhaber, we only formulate them and do not give proofs, 
leaving them to the literature. We mention here that a very extensive bibliography 
on Bernoulli numbers, compiled by Karl Dilcher, is available online [28], 

Let us now give the definition of the Bernoulli numbers again. We follow Seki 
and Bernoulli, and define them using a recurrence formula. The Bernoulli numbers 
may also be defined by using a generating function. We will see in the next section 
that these definitions are equivalent. 

Definition 1.1 (Bernoulli numbers). Define B„ (n = 0, 1, 2, ... ) inductively by 
the formula 

.A (n + l\ 

. jBj = n + l (n = 0,1,2,...). (1.2) 

Let us compute B„ for small n . For n = 0 we have 

B 0 = 1. 

Putting n = 1 in the formula (1.2) we have 

Bq + 2B\ = 2, 

from which we obtain 


4 Johann Faulhaber (bom on May 5, 1580 in Ulm, Germany — died in 1635 in Ulm, Germany). 

5 Carl Gustav Jacob Jacobi (born on December 10, 1804 in Potsdam, Prussia (now Germany) — died 
on February 18, 1851 in Berlin, Germany). 
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Fig. 1.1 Katsuyou Sanpou, Seki’s tabular presentation of the power-sum formula, written not in 
Chinese characters but using the notation of Sangi or counting rods. The table (which is reproduced 
here with a 90° rotation) contains each of the ingredients of the formula in parentheses in Eq. (1.1): 
The number designated by "power” in the translation overleaf is the k of this formula, the binomial 
coefficients are tabulated in the “Pascal’s triangle” in the right part of the table (= left part in the 
translation), the Bernoulli numbers Bj are given on the left (the right in the English translation), 
the missing coefficient (j^J) in (1.1) corresponds to the crossed out "1” at the beginning of each 
column of the Pascal triangle, and the “denominator” is the number k + 1 by which the whole 
expression has to be divided at the end of the calculation 
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Fig. 1.2 Katsuyou Sanpou 
(English translation, reflected 
with respect to the original) 
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Putting n = 2 in (1.2) we have 

Bo + 3B\ + 3i?2 = 2, 

from which we obtain 

B 2 = 1(3-50-350= i 

3 o 

Similarly, putting n = 3 in (1.2), we have 

Bo + 45j + 652 + 4B 2 = 4, 


which gives 
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Table 1.1 Bernoulli numbers 


n 

0 1 

2 

3 

4 5 

6 

7 

8 9 

10 

li 

12 

13 

14 

15 

16 

17 

18 

B n 

1 ~2 

1 

0 

_ 30 0 

1 

0 

o 

H8 

1 

5 

0 

691 

0 

7 

0 

3617 

0 

43867 

6 

42 

66 

2730 

6 

510 

798 


Putting n = 4 in (1.2), we have 

B 0 + 5B l + 10 B 2 + 105 3 + 5B 4 = 5, 


which gives 


f? 4 = ^(5 -Bo- 5 B, - 10 B 2 - 10 B 3 ) = 

and so on. Table 1 . 1 is the list of Bernoulli numbers B„ up to n = 1 8. It follows from 
the definition that B„ is a rational number. Mathematical software packages such as 
Mathematica and Maple include the algorithm for computing Bernoulli numbers, 
and it is not hard to obtain several hundred B n ’s in a moment. 

Remark 1.2. There is another convention on Bernoulli numbers: B\ = — \ and 
everything else is the same. We adopt the first definition because this is the original 
definition of Seki and Bernoulli for one thing, and it is better suited to the special 
values of the Riemann zeta function for another, that is, the formula in Theorem 5.4 
is valid also for m = 1. Although the difference is minuscule, we call readers’ 
attention to it. 

Since B„ = 0 for every odd n greater than 1 (we prove this later), it is not difficult 
to translate formulas from one definition to the other by replacing B„ by (—1)" B n . 


1.2 Sums of Consecutive Powers of Integers 
and Theorem of Faulhaber 

In this section we first prove the Seki-Bernoulli formula (1.1) for the sum 

n 

Sk{n) := i k ( k.n integers, k > O.n > 1). 

i = 1 

Then we will state Faulhaber’s theorem and give a recurrence formula for Bernoulli 
numbers at the end of this section. 

Before going into the proof, let us give several explicit formulas for Sk(n): 


S 0 (n ) = n. 
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= -n(n + 1), 
n 3 n 2 n 

Sl{n) ~ y + y + 6 

= -n(n + 1 ) (2/7 + 1), 

6 

4 3 2 

n n n 

s 3 w = t + y + T 

= ^-n 2 (« + l) 2 , 

5 4 3 

« « n n 

W 5 2 3 30 

= — 72 ( 7 ? + l)(2n + 1)(3;; 2 + 3 h — 1), 

72 6 7? 5 5/7 4 72 2 

S 5 (77) = — + — + 

6 2 12 12 

= —n 2 {n + 1 ) 2 ( 2 7? 2 + 277 — 1), 

„ 77 7 7 7 6 7 7 5 77 2 77 

*S*6 ( 77 ) = — + — + — + 

7 2 2 6 42 


= — 72(?7 + 1 ) (2?? + 1 ) (3?? 4 + 6n 3 — 3?? + 1). 


For the proof, we first see immediately that 

So (77 ) = 72 . 

Let k > 1 . From the binomial theorem we obtain 

k 


(77? + l) i+1 - 777 i+1 = 


k + V 


\m J . 


;'= o 


Putting 7?? = 1,2 ,72, and summing over all 7??, we obtain 
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From this we have 


Skin) = 


1 

k + 1 


(77 + l) t+1 



Putting k = 1,2,... in this formula, we obtain 


77 1 77 77 3 77 2 77 

St(77)- -+-,S 2 (n) - 


By induction we see that 

Sk{n) is a polynomial of degree k + 1 in 77, whose leading term is n k+1 . 

k + 1 

Our objective is to write down this polynomial explicitly. 

Consider the polynomial S k (x)(= yy-j-x t+1 + •••) obtained by replacing 
the variable n by x. In general, two polynomials /(x),g(x) are identical if 
/ ( 77 ) = g(n ) for all positive integers n . By definition we have 

Skin + 1) - Sk(n) = (77 + l/' (77 = 1,2,3 ). 

Thus, we obtain 

S k (x+ l)-S k (x) = (x+l) k . 

Putting x = 0, and using S k (1) = 1, we have 

Si-(O) = 0. 

This yields the constant term of S k (x). The other coefficients may be obtained 
from the derivatives S k ( 0), 1 < j < k. Taking the derivative of the relation 
Sk(x + 1) — S k (x) = (x + l) k , we obtain 

S' k (x+l)-S' k (x) = k(x + l) k -\ (1.3) 

Putting x = 0, 1, 2, .... 77 — 1, and adding them all, we have 

S , k (n)-S' k (0) = kS k - i (n). 

This holds for any positive integer 77 . If we let S k ( 0) = b k (bo =1), then we have 
the relation 


S' k (x) = kS k - i(x) + b k . 
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Taking the derivatives of both sides, we have 


S'lix) = kS' k _y(x). 


(1.4) 


Putting x = 0, we have 


s;'(o) = kb k - 1. 


Taking the derivative of (1.4) once again, and using (1.4), we have 
Sf (x) = kS'Uix) = k(k - 1 )S' k _ 2 {x). 

Putting x = 0, we have 

Sf (0) = k(k - \)b k - 2 . 

Similarly, taking the derivatives successively, we obtain 

S ( k j) { 0) = k(k- 1) • • ■ (k - j + 2 )b k - J+1 (2 < j < k + 1). 

Finally, we have 

j= 0 J 


-IM'Y 

i ^/k + A 

= + j J 


■+ix ; (5f(0) = 0) 


bjx k+l - j . 


Since 5^(1) = 1, we obtain, by putting x = 1 in the above formula, the recurrence 
formula 


k + 1 



which is nothing but the recurrence for the Bernoulli numbers. We therefore 
conclude bj = Bj . In view of the identity 


1 

k + 1 



1 

k + 1 - j 



we obtain (1.1). 


□ 
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Remark 1.3. If we set S' k (x — 1) = B k (x) (B, k (x) = x k + •••), it follows from 
(1.3) and (1.4) that 


B k (x + 1) - B k (x ) = kx k \ B' k (x ) = kB k -\(x). 


This B k (x) is called the k th Bernoulli polynomial, which leads to various appli- 
cations. We will give its definition and some properties in Sect. 4.3 (p. 55). The 
formula for sums of consecutive powers may be obtained easily from the Bernoulli 
polynomial. We will show it in Sect. 4.3. 

Let us prove the fact that B k = 0 for odd k greater than 1 using the polynomial 
S k (x). We will prove this fact later more easily using the generating function, but 
we prove it here because we need to use it before that. 

Proposition 1.4. If n is an odd integer greater than or equal to 3, then B„ = 0. 
As a consequence, (— 1 )" B„ = B„ for all positive integers n except for n = 1. 

Proof. Supposed > 1. Putting x = — 1 in the formula S k (x+ 1) — S/Tx) = (x+l) A 
and using the fact S k ( 0) = 0, we obtain S k (— 1) = 0. Thus, by putting x = — 1 in 
the formula 



we obtain 



If we subtract this from the recurrence formula in the definition of Bernoulli 
numbers 



then only odd indexed terms remain and we have 



k—i 


where [x] stands for the greatest integer less than or equal to x. Since If = 1/2, the 
term for j = 0 in the left-hand side cancels with the right-hand side, and we obtain 
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1^1 


E 


k+ 1 ' 

i \ 2 ' + b 


B 2j +\ =0 (k > 3). 


Putting k = 3, 5, 7, . . . , we obtain = 0 inductively for all odd k > 3. 

Next, we state Faulhaber’s theorem. 

Theorem 1.5. Let k > 1 ant/ set 


□ 


u = 77(77 + 1) (= 2Si(»)), v = n(n + 1)(2« + 1) (= 6 S 2 («)). 
Then we have 


Sik+\{n) 


2 *-i 


2k + 2 


E4 


=0 


<777 r/ 


S2Jt(«) = 


V 

(2fc + l)(2fe + 2) 


£(* + 1 -i)Af ) u k -'-\ 

1=0 


where is a number determined by the formula 


<-i 


4 k) = i, 4 (t) = - 


k + 


- — y 


/ k + 1 - j 


j=o \ k + J 


■2 i 


. \A ( f ) (1 < i < k-l). 


{Fork + j — 2 i < 0, we put = 0-) 

For the proof we refer to [30] and [60]. We encourage readers to attempt to prove 
it by themselves. 

Before ending this section we prove an amusing recurrence formula for Bernoulli 
numbers using Sk(x) [55]. 

In the defining recurrence formula for Bernoulli numbers (1.2), if we move the 
right-hand side to the left, and use the fact that ( — 1 )' If = Bj for i / 1 and 
B i = 1/2, then we obtain 


= 0 (L5) 


for 77 > 1. Now, we define B„ = (77 + 1 )B n . Then, we have 
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Theorem 1.6. For n > l, we have 


£(-iyt n + ')S^, = o. 


Remark 1. 7. Though this formula is similar to ( 1 .5), it requires only half the number 
of terms to compute B 2n (and thus /??„)■ 

Proof From the binomial theorem we have 


c n+1 (i -x) n+1 = | l \ x n+i 


+1 


Taking the derivatives of both sides with respect to 1 , we have 

n + i / ,\ 

(« + l)(x”(l - x) n+l - x n+1 (l -x) n ) = £(-iy (" T j(« + i + l)x n+i . 


From this we have 

(n + l)x n (l-x) n (l-2x) = (— 1)" +1 (2 n + 2)x ln+x 

+ ^(-D^ n | M(n +i + l)x n+i . (1.6) 

Putting x = ( I + y ) / 2 in this formula, we have 


i.h. s . = -(» + i) (i±Z) ( V) 

= — 2 -2 ” (« + \)y(\-y 2 ) n 

= -2- 2 > + i)£(— iyQ/ i+1 - 

We put x = 1, 2, .... m in (1.6), add them up, and use the equations 


^_y 2l+1 = ^(2x- 1) 2,+1 = S 2 i+i(2tn) - 2 2,+1 S 2 i+i(m) 

X=\ X = 1 


to obtain 
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- 2 ~ 2 "(n + 1) (Jj (Sx+I&m) - 2 2i+1 S 2i+l (m)) 

= (-1)" +1 (2 n + 2 )S 2 „ +1 (m) + £(-1)^” | 1 j (n + / + 1 )S n+i (m). 

Since this holds for all positive integers m, we can use the same argument as before 
to conclude that the formula obtained by replacing m by x. 


- 2 ~ 2 > + 1) £(-l )■' (J j (S 2i+1 (2*) - 2 2,+1 S 2i+1 (.r)) 

= (— l)" +1 (2n + 2 )S 2 „+i(x) + ^^( — 1)' ^ ^ (n + i + 1)S„+, (x), 

holds as a formula for polynomials. We take the derivative of both sides of this 
with respect to x and set x = 0. The desired formula follows from the facts 
that S' k (0) = Bk (previous section), that the term for i = 0 in the left-hand side 
vanishes due to the fact 2S|(0) — 25, (0) = 0, and that /?/. = 0 for odd k > 3 
(Proposition 1.4). □ 
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We have defined Bernoulli numbers by a recurrence formula. However, it is also 
common to define Bernoulli numbers using the generating function 


te r 


e‘ - 1 


°° t n 

E b »~- 

z — ' n\ 

n=0 


(1.7) 


For the other definition of B n we mentioned in Remark 1.2 in Sect. 1.1, we need to 
replace the left-hand side by — t . In fact, the generating function is very 

useful for the study of various properties of Bernoulli numbers. 

In this section we will explain fundamental facts about formal power series for 
those who are not familiar with such objects. In the next section we will prove the 
above formula as Theorem 1.12. Those who are not familiar with abstract algebra 
are advised to take a glance at the next section and see how computations go before 
reading this section. 

Let R be a commutative integral domain with unit (written 1). (Although we 
write in this general fashion, readers may think of R as the rational number field 
Q for the time being. In this book R is either Q, or its finite extension, except 
for Chap. 11, where we need p - adic numbers. A commutative integral domain is 
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a commutative ring with the property that ab = 0 implies a = 0 or h = 0.) 
A formal sum 


oo 

^ ' Cl n t n = Gq + Cl\t + U2t~ -T Cl^t 2 ~\~ ' ' ' 
n = 0 


with an indeterminate t and coefficients in R is called a formal power series with 
coefficients in R, and the set of all such formal power series is denoted by R [[?]]. 
Two formal power series are defined to be equal if and only if all coefficients of t" 
coincide. Typical examples which will appear in this book are (taking R = Q) 


and 


, t t 2 t 2 

' = £^ = ' + 17 + 51 + 5; 




io g (i + f) = j](-ir- 1 

n = 1 


n 



These are Taylor expansions of complex functions e x and log( 1 + x) around x = 0, 
regarded as formal sums. We use the notation e\ log(l + t), but these are nothing 
but the formal power series of the right-hand sides, and we will not consider them 
as functions in t . 

The sum and the product of two formal power series are defined to be 

OO OO OO 

+ £>»*" = £>» + *»)*" 

n= 0 n= 0 /i=0 

and 

\i=0 ' \=o 

The product is defined using the distributive law formally, and the first few terms of 
it are 



(ao + a\t + ciot~ + + •••)• (bo + b\t + bit^ + bit 2 + • • • ) 

= a 0 b 0 + (aobi + a\b 0 )t + (a 0 ^2 + «i^i + fl2^o)f 2 
+ (ciobi + a\bi + ciib\ + aibo)t 3 + • • • . 

It is readily verified that /s’ [[/ ]] is a commutative ring with unit. The zero element 
and the unit are the formal power series 

0 + 0 • t + 0 • f 2 + 0 • r 3 -i — 
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and 


1 + 0 • t + 0 • t 2 + 0 • t 3 H , 


respectively. They are denoted simply by 0 and 1, respectively. The set of formal 
power series all of whose coefficients are 0 except for the constant term is naturally 
identified with R. Furthermore, regarding a polynomial P(t) with coefficients in 
R as a formal power series whose coefficients a n are 0 when n is greater than the 
degree of P(t), we can consider /s’ [r] as a subring of /?[[/]]. The definitions of the 
sum and the product of /?[[/]] are natural generalizations of those of polynomials. 

Since we assume that R is an integral domain, [[?]] is also an integral domain. 
Namely, we have 

Proposition 1.8. The ring !?[[/]] does not have a zero divisor (the product of 
two non-zero elements is again non-zero ). In particular, if A ^ 0 and AB = 
AC (A, B,C € R[[t]]), then B =C. 

Proof. Let A = ciq + a\t + cht 2 + a^t 3 + ■■■ and B = b 0 + b\t + bit 2 + b^t 3 + 

■ ■ ■ be non-zero elements, and let a^, b/ be the first non-zero coefficients. Then the 
coefficient Ck+i of t k+l in A ■ B = cq + C\t + C 2 t 2 + • • • is 


Ck+i — a o bk+t + ct\bk+i-\ + • • • + akbi + ■ ■ ■ + ak+i-\b\ + ak+ibo — cikbi- 


Since R is an integral domain, Ck+t = Okhi is not 0. Thus A ■ B is not 0. 

If AB = AC, then A(B — C) = 0, which implies ^4 = 0 or B — C = 0. Thus, if 
A ^ 0, then we have B = C . □ 

Proposition 1.9. A formal power series ti„t" is invertible in R [[/ ]] if and 
only if the constant term a o is invertible. 

Proof. If a „t" is invertible, i.e., if there exists a formal power series 

E~o suc h that (£,r=o a„t") ■ (Y2T=o but") = 1, then the constant term aobo 
of the product must be 1 , and thus ao is invertible. Conversely, suppose that ao is 
invertible. We define b„ e R (n > 0) by 

n 

bo = af l and b n = —af l ■ ^ a (n > 1), 

i — l 

and we compute the product ('Y2™=o a n t "') ' (Y2nLobnt n ). Then, we have aobo = 1 
and Y2'i=o a ibn-i = 0 for n > 1. This shows that bnC is the multiplicative 

inverse of Y2^=o a » l " ■ D 

We denote by ,4(0) the constant term of a formal power series A(t). As we 
mentioned before, a formal power series is not a function and we do not generally 
replace t by any special value. However, replacing t by 0 is always valid, and we 
use this notation exceptionally for this case. 
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Next, we show that we can define an infinite sum of elements of [[/]] under 
certain conditions. Let {Ak{t)}^L x , Ak(t) = Y^=o a f^' be an infinite family of 
formal power series. Suppose that these series satisfy the condition 

for each z, there exist only finitely many k such that a\ k) ^ 0. 

In other words, if we write Ak(t) = a^t" k + (higher-order terms) (a^j ^ 0), we 
have tik — > oo as k — > oo. Then we define A\ + A2 + A3 + ■ ■ ■ to be 

OO 

A\ 4- Ao + Aj + • • ■ = y ' a\t ' , 

;=0 


where 


00 



jfc=i 


In short, we take the sum of the coefficients in a common degree, and the above 
condition ensures that each sum is a finite sum. 

As a special case, we can define the “substitution” of a formal power series 
with vanishing constant term into another formal power series. Namely, if A(t ) = 
a it ‘ , and B(t ) = E^i bit 1 (B( 0) = bo = 0), then we can give a meaning 
to the expression 


A(B(t)) = ^>(S(0)'\ 

i =0 


because, from bo = 0, the polynomial a, ( B(t)Y begins at least in the degree i term, 
and the family {a, (B(t)) 1 }°^ () satisfies the above condition. As an easy example, 
we have 


= , + tn + 


1 ! 


2 ! 


+ 


(- 0 3 

3! 


+ ... 


, t t 1 L 

1 _ IT + 2! _ 3! + ' 


and 


log ( 1 - t ) = log( 1 + (-0) = 



(- 0 3 

3 



The identities 


e \og(l+t) = l+t 
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and 


log(l + (e'- 1 )) = t 

can be considered as the identities obtained by substituting one formal power series 
to the other. (We will give proofs of these identities after Remark 2.7 on p. 33.) 

We can also compute the reciprocal of a formal power series with constant term 1 
by “substitution”. First we remark that the reciprocal of 1 — t is 1 + / + t 2 + ■ ■ ■ : 

1 9 

= 1 + t + t 2 + ••• . 

1 - 1 


We can see this by computing the product (1 — t)(l + t + t 2 -\ ). Now, let 1 + 5(0 

be a formal power series with constant term 1. Then B(t ) is a formal power series 
without constant term (5(0) = 0), and thus the reciprocal of 1 + 5(0 can be 
computed by 


1 


1 


= l + (-5(0) + (-5(0) 2 + -- 


1 + 5(0 l-(-5(0) 

As an example, we compute the first few Bernoulli numbers using (1.7). Since 


e ’~' _ ' + 2! + 3! + 4i + 5! + " 




t r O r 

1 + — + — + ~ + “T77 + 

2 6 24 120 


we have 


7 v = i-( 

( 1 + i _|_ \L + ii _|_ _| ^ V 

( 

-( 

(! 


t_ 0 0 t*_ 

2 + 6" + 24 + 120 + " 


■) 


t t 2 t 3 
+ l 2 + 6 + 24 + 


T20 

4 


+ ■ 


t t 2 0 v 
2 + "6 + 24 + l20 + ’ 


■y 

■) J 


/3 


t H 


+ [t 2 + r ~6 + k + U0 + ' 


+ 


t t 

1 — — + — — + 0 • t 
2 12 


3 


? 4 

720 


+ 
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From this we have 


te 1 

e‘ - 1 


(■ 


fl + - + — + — -I L . . . 

^ 2 T 6 ^ 24 ^ 120 T ) 


• t z t i r 

— (l + ^H — “ + ~ + ~ "F ‘ 

1 2 6 24 

t t 2 , t 4 

= 1 + — + — + 0 • t — + • • 

2 12 720 


t t 2 _ 3 t 4 


1 — — + — + 0 • t 2 — A~ • ■ 

2 12 720 


Thus, we have Bq = 1 , B\ = j, Bi = \,Bi = 0, B^ = — 

In this “substitution” operation, the following proposition gives the “inverse”. 

Proposition 1.10. For a formal power series Ait) = a 0 + ci\t + ait 1 + ■■■, there 
exists a formal power series B(t ) such that 

Bit)) = 0, A(B(t)) = t 


if and only if 


ao = 0 and a \ is invertible. 

In this case B(t) is unique, and we have B{A(t)) = t. In other words A(t) and B(t ) 
are inverse to each other with respect to composition. 

Proof. If Bit ) = bit + bit 2 + ••• exists and satisfies A{B{t)) = t, then by 
comparing the constant terms and the degree 1 terms, we have ao = 0 and a \ b \ = 1. 
This shows that the condition is necessary. 

Conversely, suppose that A(t) satisfies a o = 0 and a \ is invertible. We would 
like to determine the coefficients of B{t) = b\t + bit 2 + bit 3 + ••• satisfying 
A(B(t)) = t. We first see from the coefficient of t that a\b\ = 1. Since a i is 
invertible, we put b\ = af l . For n > 2, the coefficient of t” in A{ Bit )) equals the 
coefficient of t" in 

a\(B(t)) + ai{B(t))~ + aiiBit)) 2 + ■ ■ ■ + a n [B[t)) n 

since there is no term t" after this due to the fact that Bit)) = 0. This can be 
written as 


a\b n + (polynomial in cii, a%,..., a n ,b\,bi b n - 1 ). 

If b\, bi, ■ ■ ■ , b n - 1 are already determined, then we can determine b„ uniquely from 
the fact that the above formula equals 0 and a\ is invertible. This proves the existence 
as well as the uniqueness of B(t). 



1.3 Formal Power Series 


19 


The B(t ) obtained in this way satisfies B(0) = 0, and b\ is invertible. We thus 
see that there is a C(t) (C(0) = 0) such that 

B(C(t)) = t. 

Then we substitute C(t) for t in the formula t = A(B(t )), and use the fact that 
B(C(t)) = t to get 

C(t ) = A(B(C(t))) = A(t). 


Thus, we have 


B(A(t)) = t. 


□ 


The formulas we mentioned earlier, 

e log(l +0 _l =t and log(1 + ( e l _ !)) = 


can be interpreted as saying log(l + t) and e‘ — 1 are inverse to each other with 
respect to composition. 

The derivative of formal power series Yff=o a nt'\ written jj (^,^L 0 a„t n ) 
(or (£~ 0 a„t"Y), is defined formally by term-by-term differentiation: 

d \ ~ _ 

— I 2_^ a nt I = 2_, na nt = a l + 2 a 2 1 + 3ait- + 4 ( 24 ? + • • • . 

\n=0 / n = 1 

For example, ( e')' = e‘ , and (log(l + t))' = 1 — t + t 2 — t 3 + ■ ■ ■ = y^-. 
This definition satisfies the usual rule of derivatives regarding sum and product. 
We give a proof of the product rule (f(t)g(t))' = f'(t)g{t) + f(t)g'(t). Let 
fit) = T,T=O a nt "- and g(t) = YffLo b nt n - Then 


oo / n 


00 / n 


(f(t)g(t)y = ( E ( E a ‘ b >- n = E ( n E a ‘ h -' i r 

/ / n = 1 V i= 0 


n — 1 


\n = 0 \i= 0 
oo / n 


= idibn-i + - i)aib n -i J t n 1 

n = 1 \i=0 i= 0 / 

oo / n \ oo / n — 1 

= ^2 ( E ia fn~i ) f" _1 + ^2 ( E a, '(” _ i ' )b " 


+n — 1 


= 1 \i = 1 


= 1 \i=0 


= /'(Of (o + m g \t). 
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Also, when R D Q, the integral 
integration: 


ft °° 

I y'' a n t" dt is defined formally by term-by-term 

Jo „ 

n= 0 


ft 00 00 ^ + 1 ^3 ^4 

/ V a„f" dt = y a„ = a 0 t + a x — + a 2 — + « 3 — + 

Jo y; y n + 1 2 3 4 

77 = 0 77 = 0 


In order to treat the generating function of Bernoulli numbers, it is convenient 
to generalize formal power series to Laurent 6 series (with a finite number of terms 
with negative powers). 

Definition 1.11. The set of formal Laurent series with coefficients in R, 

OO 

y «;?' (for some integer A), 

i=—N 


is denoted by R ((f)). 

The sum and the product in R((t)) are defined in the same way as in /<■[[/]] and with 
these operations R((t)) is a commutative integral domain that contains A 3 [[/]] as a 
subdomain. Also, _ v a, f' is invertible in R((t)) if and only if the first non-zero 
coefficient <a_ y is invertible. These can be proved in the same way as in the proof of 
Propositions 1.8 and 1.9. In particular, if R is a field, then so is R((t)). Furthermore, 
the formal derivative and integral (when R D Q) are also defined, and the quotient 
rule ( f(t)/g(t ))' = (f(t)g(t) — f(t)g'(t))/g(t) 2 holds. This follows from the 
product rule. (Put f(t)/g(t ) = lr(t), and take the derivative of /(f) = g(t)h{t).) 


1.4 The Generating Function of Bernoulli Numbers 


Theorem 1.12. Let B„ (n = 0, 1, 2, . . . ) be the Bernoulli numbers. We have the 
following formula in Q((f)).' 


te' 


e< - 1 


^ fll 

E b »~- 

z — ' n\ 

n=o 


As a matter of fact, the right-hand side belongs to Q[[f]], but we consider the 
quotient of the formal power series te 1 by e* — 1 in Q((f )). 


6 Pierre Alphonse Laurent (born on July 18, 1813 in Paris, France — died on September 2, 1854 in 
Paris, France). 
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Proof. It is sufficient to show (Yl^Lo Bn ^y)(e f — 1) = te 1 . The formula in question 
is obtained by dividing both sides by e 1 — 1 . From the definition of the product of 
formal power series we have 



( e * — 1 ) = 




oo /n — 1 

= E E 

n= 1 \/=0 


Bi 


1 


i ! (n — i )! 


t 


n 


OO / 72 — 1 

=eeC 

72 = 1 \/=0 



From the recurrence formula (1.2) we have Yl'i=o (”)Bi = n for all n > 1. Thus 
we have 


n=l \i=0 

which concludes the proof. 




□ 


Remark 1.13. Conversely, if we define B„ by the formula in the above theorem, 
then we have 


t n 




\72=0 


Expanding the left-hand side as in the above proof, and comparing it with the right- 
hand side, we obtain the recurrence formula (1.2). This shows that Definition 1.1 
and the definition using the generating function in Theorem 1.12 are equivalent. 

Using the generating function, we can give a simpler proof of Proposition 1 .4. 

Proposition 1.14 (Proposition 1.4 revisited). If n is an odd integer greater than 
or equal to 3, then B lt = 0. 

Proof. It suffices to show that the formal power series ^ does not have any 

odd-degree terms. Since we have 

te’ t t(e‘ — 1 + 1 ) t t t 
e< - 1 ~ 2 ~ e< - 1 2~ e 1 - 1 + 2 

and 

(— t)e~ f (—t) —tttt 

- — — - = h - = b 

e-'-l 2 1 — e' 2 e< - 1 2 
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4^r — 4 is invariant under the substitution t — > — t . This shows that the coefficients 
of odd-degree terms are all 0. □ 

Looking at B n for even n in Table 1.1, we see that they are non-zero and, from 
Bo on, the signs alternate. This may be seen from the formula for the value of the 
Riemann zeta function %(s) = A- at positive even integers (replace k by 2k 

on p. 61) (4.4): 


£(2 k ) = 


(-1)*- 1 (2jt) 2k p 

2 (2k) l Blk 


(k > 1). 


But here, we give a direct proof using the following recurrence formula for the 
Bernoulli numbers due to Euler 7 [33]. 

Proposition 1.15. 


n — 1 

(In + 

m=l 



Bom B2(n—m) 


(n > 2). 


Proof. By the previous proposition, if we subtract the term of degree 1 from the 
generating function of the Bernoulli numbers, we obtain the generating function of 
even indexed B 2n '■ 


te 1 t 
e‘ - 1 ~ 2 


r. ^ 2 " 


f2n 


( 2 «)!' 


Let f(t ) be the left-hand side. Taking the derivative of f(t ) as was explained on 
p. 19, we have 


Substituting Jf,T=o B 2 « for f(t), we have 


^(l-2n)fi 2B 

n=0 


(2 n)\ 


oo / n 

EE 



B 2m 


(2n)l 


~4' 


7 Leonhard Euler (Bom on April 15, 1707 in Basel, Switzerland — died on September 18, 1783 in 
St. Petersburg, Russia). 
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Comparing the coefficients of both sides, we have 


( 1-2 n)B ln = 

m= 0 



^2m ^2(n—mj 


for n > 2. Since the terms for m = 0 and m = n yield 2Bi„, we subtract them from 
both sides. Multiplying by —1, we obtain the formula in the proposition. □ 

Corollary 1.16. Forn > 1, we have (—\) n ~ x Bin > 0. 

Proof. We prove this by induction. For n = 1, Bi = g > 0. Suppose that the 
proposition is true for all integers less than n. By multiplying the formula in the 
proposition by (— l)" -1 , we get 


( 2 n + 1 ) - (-1 ) n - l B 2n = B 2m (-\) n - m - x B 2(n - m) . 

The right-hand side is positive by assumption. Thus, (— \) n ~ l B 2 ,, > 0, which settles 
the proof. □ 

To conclude this chapter, we give expansions of tan* and cot x as alternative 
generating functions for Bernoulli numbers. 

Proposition 1.17. The Taylor expansion oft an x and the Laurent expansion of cot x 
around x = 0 are 


tanx = 


OO 

£(- l) n ~ 1 (2 2n 

n = 1 


- 1)2 2 "B 2 „ 


x 2n-l 

7277 ) 7 ’ 


cotx = 


1 


+ 1 )" 2 2 " Blf, 

n = 1 


X 2 "- 1 

(2 ny.' 


The right-hand sides converge for \x\ < f and 0 < \x\ < n. respectively. 

Proof. The formal power series f(t) in Proposition 1.15 can be written as f(t) = 
j coth(j), where coth t = is the hyperbolic cotangent. Replacing t by x, we 

have 





B 211 


x 2n 

(2 nV-’ 


which is regarded as the Laurent expansion of coth(f). Replacing x by 2 ix (i = 
V— 1), dividing both sides by x, and using the fact i coth(ix) = cotx, we obtain 
the Laurent expansion of cotx. From the duplication formula for cotx which reads 
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we have 


tanx = cotx — 2cot(2x). 

This shows that the Taylor expansion of tan x can be obtained from the expansion of 
cotx. As for the radii of convergence, we see it from the fact that the poles nearest 
to 0 are located at ±7 t/ 2 and ±jt, respectively. □ 

Remark 1.18. The coefficient of in the Taylor expansion of tan x, 

T n := (— 1)' I_1 (2 2 " - 1)2 2 " — , 

2n 

is sometimes called the tangent number. It is a positive integer. The fact that it is 
positive is seen from Corollary 1.16. We will prove that it is an integer in Remark 7.6 
on p. 98. Knuth and Buckholtz [61] give another proof of this fact and a method to 
compute Bernoulli numbers using T n . See also Exercise 3.6 in Chap. 3. 

Exercise 1.19. Compute the Bernoulli numbers B n up to n = 12 by using 
Theorem 1.6. 

Exercise 1.20. Guess any pattern of prime numbers appearing in denominators of 
Bernoulli numbers. (This is one of the topics covered in Chap. 3.) 

Exercise 1.21. Compute the first several terms of the formal power series e lllg(l+,) 
and log(l + (e' — 1)) using the definition of composition. 

Exercise 1.22. Prove the quotient rule 

if it)/ gif))' = ( f(t)g(t ) - f(t)g\t))/g{t) 2 

of formal Laurant series. 

Exercise 1.23. Compute first several tangent numbers T n , and check that they are 
positive integers. 


Chapter 2 

Stirling Numbers and Bernoulli Numbers 


In this chapter we give a formula that describes Bernoulli numbers in terms of 
Stirling numbers. This formula will be used to prove a theorem of Clausen and 
von Staudt in the next chapter. As an application of this formula, we also introduce 
an interesting algorithm to compute Bernoulli numbers. 

We first summarize the facts on Stirling numbers. Those Stirling numbers we 
need in this chapter are of the second kind, but we also introduce Stirling numbers 
of the first kind, which will be needed later. 1 


2.1 Stirling Numbers 

We first define the Stirling numbers of the second kind, restricting ourselves to the 
case where a combinatorial meaning can be given. We adopt Knuth’s notation [59]. 

Definition 2.1 (Stirling numbers of the second kind (Stirling’s subset 
numbers)). 

For positive integers n and m , define 

n ] 

> := the number of ways to divide a set of n elements into m 

ml 

' nonempty sets. 

(Knuth proposes to read this as “n subset m”.) 


'According to Knuth [59], Stirling (James, bom in May, 1692 in Garden, Scotland — died on 
December 5, 1770 in Edinburgh, Scotland) first introduced the second kind. The names “first kind" 
and “second kind” are due to Nielsen (Niels, bom on December 2, 1865 in Orslev, Denmark — died 
on September 16, 1931 in Copenhagen, Denmark), who first used these names in his book on the 
Gamma function [73, §26]. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_2, © Springer Japan 2014 
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For example, there are seven ways to divide the set {1,2, 3,4} into two non- 
empty sets: 

{1} U {2, 3, 4}, {2} U {1,3, 4}, {3} U {1,2, 4}, {4} U {1,2, 3}, 

{1, 2} U {3, 4}, {1, 3} U {2, 4}, {1,4}U{2,3}. 

Thus, {j} = 7. 

From the definition, {"} = 0 if m > n. Also, from the definition we have the 
recurrence formula: 


n + 1 j 

m j 


n 

m — 1 



( 2 . 1 ) 


This formula can be proved in the same way as in the combinatorial proof of the 
relation of binomial coefficients: ("+') = + (,”)■ Suppose we divide a set 

of n + 1 elements into m sets. We look at one particular element. If this element 
forms a set by itself, there are { ways to divide the remaining n elements into 
m — 1 sets. If this element belongs to a set with other elements, there are { " n } ways 
to divide other n elements into m sets, and there are m ways to put this particular 
element into one of these sets, and thus there are m { " m ) ways altogether. 

We now proceed to define { " | by the recurrence formula (2.1) for any integers 
m and n . 


Definition 2.2 (Stirling numbers of the second kind (general case)). For any 

77 1 

integers 77 and 777 , define > by the recurrence formula (2.1) with the initial 

m \ 

01 {77 1 10 


conditions 


1, and 


0 ( 77 , m 7 ^ 0 ). 


From (2.1), if we know two out of three values {"+ 1 }, { and \" m \, we can 
determine the third except for the case m = 0, where we cannot divide m J " } by m. 
In this case the value {” J = 0 is already given in the definition. We give the values of 
{^} for —7 < 777, 77 < 7 in Table 2.1 on p. 27. It is easy to see how all values of {" } 
are determined from the initial conditions, considering how the positions (77 + 1 , 777 ), 
( 77, 777 — 1), and ( 77 , 777 ) are located in the table. 

The new definition of {^) coincides with the previous combinatorial definition 
when 77,777 > 0. To see this, it suffices to verify {"} = 1 (77 > 1) and { ’ } = 
0 (777 > 2) under the new definition. Since these values are obvious under the old 
definition, and the recurrence formulas are the same for both definitions, the claim 
follows. We put m = 1 in (2.1) to obtain 


72 + 1 

1 


+ 


2. 1 Stirling Numbers 


27 


Table 2.1 


n 

m 


m\n 

-7 

-6 

-5 

-4 

-3 

-2 

-1 

0 

1 

2 

3 

4 

5 

6 

7 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

21 

5 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

15 

140 

4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

10 

65 

350 

3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

6 

25 

90 

301 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

3 

7 

15 

31 

63 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-2 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-3 

0 

0 

0 

0 

1 

3 

2 

0 

0 

0 

0 

0 

0 

0 

0 

-4 

0 

0 

0 

1 

6 

11 

6 

0 

0 

0 

0 

0 

0 

0 

0 

-5 

0 

0 

1 

10 

35 

50 

24 

0 

0 

0 

0 

0 

0 

0 

0 

-6 

0 

1 

15 

85 

225 

274 

120 

0 

0 

0 

0 

0 

0 

0 

0 

-7 

1 

21 

175 

735 

1624 

1764 

720 

0 

0 

0 

0 

0 

0 

0 

0 


Putting n = 0 in this formula, we obtain {]} = 1. Also, since { " } = 0, we have 
{"T'l = {"}> anc * ^us {"} = 1 (n > 1). Similarly, {^} = 0 {m > 2) can be verified 
by putting n = 0 in (2.1). 

Remark 2.3. Logically, we only need Definition 2.2. However, in order to make the 
meaning clearer, we began with Definition 2. 1 . 

We now define the Stirling numbers of the first kind. 

Definition 2.4 (Stirling numbers of the first kind (Stirling’s cycle numbers)). 

For positive integers n and m, define 

n 1 

:= number of permutations of n letters (elements of the sym- 

TYl 

L J metric group of degree n ) that consist of m disjoint cycles. 

(We read “n cycle m”.) 

An element of the symmetric group of degree n (the group of all permutations 
(bijections) of the set {1, 2, . . . , n}) can be decomposed uniquely into the product 
of disjoint cycles. (See any textbook of group theory, for example [42, Chap. I, 
Th. 6.3].) Among the permutations of n letters, we count the number of permutations 
that decompose into m disjoint cycles (including the cycles of length 1), and that 
is [ ^ ] . For example, the permutations of four letters that decompose into two 
disjoint cycles are 
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(1) (2 3 4), ( 1 ) (2 4 3), (2)(1 3 4), (2)(14 3), 

(3)(1 2 4), (3)(1 42), (4)(1 2 3), (4)(1 3 2), 

(1 2) (3 4), (1 3) (2 4), (1 4)(2 3). 

Thus, = li- 
lt follows immediately from the definition that ] = 1 (only the identity), [" ] = 
( n — 1)! (number of n -cycles). 

The Stirling numbers of the first kind satisfy the recurrence formula 


77+1 

777 


77 

777 — 1 


+ 77 


77 

777 


( 2 . 2 ) 


We can see this in the same way as before. A permutation of 77 + 1 letters may fix 
the letter 77 + 1 . In this case the letter n + 1 forms a cycle by itself, and there are 
[ — - 1 ] permutations of the remaining n letters that decompose into 777 — 1 disjoint 
cycles. A permutation that moves the letter 77 + 1 can be obtained by inserting the 
letter 77 + 1 in one of the m disjoint cycles in the decomposition of permutations 
of 77 letters. There are j ways to form a cycle of length j + 1 out of a cycle of 
length j by inserting another letter. Therefore, there are n ways to insert the letter 
77 + I, and thus there are in total n [ ] permutations that move the letter 77 + 1. This 
shows (2.2). 

Just as we did for the Stirling numbers of the second kind, we define [^] for any 
777 and 77 as follows. 


Definition 2.5 (Stirling numbers of the first kind (general case)). For any 

r 77 1 

integers n and 777 , define by the recurrence formula (2.2) with the initial 

77 ? J 

"01 . . r«i r 0 


conditions 


0 


= 1 , and 


= 0 ( 77, 777 ^ 0 ). 


Table 2.2 shows the values of [^] in the same range as Table 2.1. Readers will 
certainly notice the remarkable relations between these two tables. As Knuth asserts, 
there is only one kind of Stirling number, and the recurrence formulas (2. 1 ) and (2.2) 
are essentially the same formula (see the next proposition). 

We list some formulas which Stirling numbers satisfy. There are more formulas, 
but we only list the ones we need later in this book. For others, see [38] for example. 


Proposition 2.6. 


" n " 


—ml 

in _ 


i-4 


n 



m = 0 


( 2 ) 


m 
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Table 2.2 

-mi 


m\n 

-7 

-6 

-5 

-4 

-3 

-2 

-1 

■a 

1 

2 

3 

4 

5 

6 

7 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

6 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

21 

5 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

15 

175 

4 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

10 

85 

735 

3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

6 

35 

225 

1624 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

3 

11 

50 

274 

1764 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

2 

6 

24 

120 

720 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-2 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-3 

0 

0 

0 

0 

1 

3 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-4 

0 

0 

0 

1 

6 

7 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-5 

0 

0 

1 

10 

25 

15 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-6 

0 

1 

15 

65 

90 

31 

1 

0 

0 

0 

0 

0 

0 

0 

0 

-7 

1 

21 

140 

350 

301 

63 

1 

0 

0 

0 

0 

0 

0 

0 

0 


where x— stands for 

x— = x(x — 1 ) (x — 2) ■ ■ ■ (x — m + 1) ( m > 0), x- = 1. 


(3) 


(-ir 

m = 0 


n 

m 



(4) 




(n > 1). 


(5.1) 


Form , n > 0, 





(-1 ) m Sm,„ 


(5.2) 


For m, n > 0, 


/>o L J ' 


(-1 ) m 8 m , n 


Flere , 8 m%n stands for Kronecker’s delta; i.e., 8 mn = l form = n and 8 mM = Ofor 
m n. The sums on the left-hand sides of { 5.1) and (5.2) are finite sums. 
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( 6 ) 


For m, n > 0, 



(-D" 


ml 


E(-d 


/=0 



(7) 


(e r - l) m 
m\ 



t" 

— (m > 0). 
n\ 


( 8 ) 


(1 — f)(l — 2t) ■ ■ ■ (1 — mi) 



(m > 1). 


(9) 


(-logd - or 

m\ 


oo 



n=m 


t" 

— (m > 0). 

n\ 


Proof. (1) Write a n m = {_”'}• It suffices to verify that a„ m satisfies the same 
recurrence formula with the same initial conditions as those for [ ” ]. It is easy 
for the initial conditions. In the recurrence formula (2.1), replacing m by —n 
and n by —m , we have 


—m + 1 
— n 



Adding n { } to both sides, we have 


—m + 1 
—n 




This implies a njn - \ + na tum = a n + i, m , which is nothing but the recurrence 
formula (2.2). 

(2) We use the same strategy. Namely, write x" = Y^m=Q a n.mX—~ and verify that 
a n m satisfies the same recurrence formula as { " n | within the range m, n > 0 
and the same initial conditions. It is easy for the initial conditions (put ao Jn = 0 
(m > 0)). From x'-^- = x—(x — m ) and x--x = x'-^- + mx—, we have 


72 + 1 

y, a„+ i, m x- = x n+x = x" ■ x 

777=0 


72 

y a nm x--x 

772=0 
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= a n,m(x tn -^- + mX~) 

777=0 


77 — I — 1 n 

^ ' @n,m—\X b ^ ' Wl&n,niX • 
m=l in— 0 


Comparing the coefficients of x— of both sides, we obtain the recurrence 
formula (2.1). (Here, we put a„-\ = a, un +\ = 0.) 

(3) is similarly proved and we omit it. 

(4) We prove it by induction. For n = 1, both sides equal x X. Suppose the formula 
is true for n . Then 



(To prove the next-to-last equality, we use {„+j} = 0 and { '' | = 0.) This shows 
that the formula is true for n + 1 . 

(5) We prove it using (2) and (3). For (5.1), we put the formula 

1 

*-=(- D'EC-i)' 

777 = 0 

obtained by replacing n by / in (3) into the formula 
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which is obtained by replacing m by / in (2). Then we have 


l=o 




-l) m 

" / " 


_ m _ 

n \ 

" / " 


_m _ 


Comparing the coefficients on both sides, we obtain (5.1). The formula (5.2) 
can be obtained by substituting (2) into (3). 

(6) We prove it by verifying that the right-hand side satisfies the recurrence formula 
for \ n m \- Denote the right-hand side by a nm once again. It is easy to see that 
a o,o = 1 and a„ ,o = 0 for n > 1. (Define 0° = 1.) If n = 0, and m > 1, then 
Yl'"=o(~ l) 7 ( 7 ) = (1 — l) m = 0- Thus, we have ao.m = 0. As for the recurrence 
formula, we verify it as follows: 


Wd, l m T @n,m~ 1 


(- 1 )” 




m— 1 


Hr 


m\ I m — V 


1=0 


r 


m — 1 

L / , 


(m — 1)! ' m \ l 

’ i=o \ 


(- 1 ) 


fEh/ ; - 


1=0 


U>! + 1 ,n 


One can prove (6) more naturally using the difference calculus. See for example 
[53, §58]. 

(7) First, note that the right-hand side of (7) can be expressed as the sum from 
n = 0. (If n < m, then { } = 0.) Substitute (6) into the right-hand side and 
simplify: 




r.h.s = 


n = 0 1 

Hf 

m\ 


mi 


B-»' 


/ =o 


I m 
1 / , 


l n \~ 

I n\ 


p-'A'WJ 
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(- 1 )" 


ml 




1=0 


(- 1 ) 


m 


ml 
(e' - l) m 


a-ey 


= l.h.s. 


It can also be proved by writing (e' — \) m / ml = Yl™=m a n,m and verifying 
that a n m satisfies the same recurrence formula as { ” | . 

(8) Denote the right-hand side by From the recurrence formula (2.1) for { " J , 
we have (noting that {" ) = 0 for;? < ;;?) 


/» = £ 



t 


n 



+ m 


n — 1 
m 


I) 


t 


n 


oo » oo , 

= f T n 1 f' + mtT n \t 
' m — 1 1 ' m 

n = in — 1 ' n=m ' 


n=m— 1 
tfm— 1 Wltfm* 


Thus, f m = fm—\ ■ From {"} = 1 (n > 1), we have f\ = ~y, which shows 
that f m is equal to the left-hand side of (8). 

(9) Writing (— log( 1 — t )) m / m ! = Y^= m a n.m yyr, and taking the derivatives of both 
sides, we have 


(-log(l-r))'"- 1 1 

(;;? — 1)! 1 — t 


OO 

^ ' ®n,m 
n=m 


t n ~ l 
( n — 1)! 


From this we have 


00 t n 00 t n - 1 

X! = f 1 _ f ) XI a ”- m i n _ mi • 

n=m — 1 ’ n=m ' 

This shows that a„ m satisfies the same recurrence formula as [ ; "J. It is easy to 
verify the initial conditions. □ 

Remark 2. 7. Sometimes the Stirling numbers are defined by the formulas (2) and 
(3) in this proposition. 
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Using (5.1) and (7) in this proposition, we give a proof of the formulas that were 
left unproved in the previous chapter (p. 17). We give a proof of 

log ( 1 + (e 1 - 1)) = t. 

Once this is proved, the converse 


e iog(i+r) _ i = t 

follows from Proposition 1.10. (Of course, we can prove it similarly using (9) in this 
proposition.) 


log(l + ( e ' - 1)) = JViy"- 1 ^ 

z — ' m 

m= 1 

o° oo , , 

= Enr 1 ^- 1 )!! " - 

m n 

in = ] n=(l ' ' 


m = 1 

OO / OO 


EEm - 1 

n=0 \m — 1 


71 })^ ((m- 1)! = 

m\ J n\ \ 


= E(-i 

z — ' n\ 


2.2 Formulas for the Bernoulli Numbers Involving 
the Stirling Numbers 

A formula expressing the Bernoulli numbers as a finite sum involving the Stirling 
numbers of the second kind has been known at least since Kronecker 2 [62], (See 
also an account of von Staudt in Sect. 3.3.) By Proposition 2.6 (6), the Stirling 
numbers can be expressed as a finite sum involving only the binomial coefficients. 
Therefore, the Bernoulli numbers can be expressed as an elementary (i.e., involving 
only binomial coefficients and simple polynomials) double sum. Gould [36] studied 
various historical sources for analogous formulas, and he concluded that, at least 
until around 1970, the very fact that there are closed finite formulas (without 
involving infinite sums and integrals) for the Bernoulli numbers was not widely 
known. He also conjectures that there is no “elementary” (in the above sense) 


2 Leopold Kronecker (bom on December 7, 1823 in Liegnitz, Pmssia (now Legnica, Poland) — died 
on December 29, 1891 in Berlin, Germany). 
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formula for the Bernoulli numbers expressed as a finite single sum. As far as we 
know, such a formula has not been discovered. 

We prove the following formula. 

Theorem 2.8. 


B 


n 


(-i >"£ 

m= 0 


m + 1 


(n > 0). 


Proof. Rewriting the generating function of B„ as 

te‘ t — log (1 — (1 — e - ')) 

e 1 — 1 1 — 1 — e~ l 

and substituting 1 — e~‘ (= t + ■ ■ ■ ) for t in — log(l — t) = — , we have 

y Bn -= 

" n ! 


= E 


1 - e~' 

(1 - e ~T -1 


= E 




m + 1 


y-v (-l) m 7«! y- (») 

m + 1 m I 

vn —C\ n = m ' 


n 1 (— t) n 


n\ 


(Proposition 2.6 (7)) 


= £<-»" £ 


n=0 


\m = 0 


( -irm!{;} \ p- 

m + 1 / n\ 


Comparing the coefficients of — on both sides, we obtain the formula. 




Remark 2.9. By Proposition 2.6 (6), we can also write 
B„ 


’■ = (-.)>£^t£<-.)'(7)<". 


Proposition 2.10 (A variation of Theorem 2.8). 

B „ = T 


m= 0 


m + 1 


(n > 0). 
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Proof. We can prove this by computing the generating function on the right-hand 
side. Here, we use Theorem 2.8. From the recurrence formula (2.1) for the Stirling 
numbers we have {"+*} = {'' } + (m + l){ m + I }, and thus 

(— l)'”?n! ( { ”, } + (m + 1 ){,„+]}) 

m + 1 


r.h. s . = e 

m = 0 


On the other hand, since 


I” m- f-1 

L 1 


y^ ( !)"'?«!(»;+ l){ m+1 } _ y^ 

^ m + 1 ~ 2^ [ > 


ml, it follows from Proposition 2.6 (5.1) that 

= Si„. 


ra=0 m = 0 

Hence, the right-hand side of the proposition equals 


m + 1 
1 


n 

m + 1 


" (— l) m m!{"| 

E ... . , + 8 '-» = (- 1 )" 5 " + 8 '-»- 


m= 0 


m + 1 


(Theorem 2.8.) For n = 1, this formula equals —B\ + 1 = \ = B\. For n f 1, it 
equals (— 1 )" B n , which coincides with B„ due to Proposition 1.4 on p. 10. □ 

This proposition can be interpreted as the following algorithm, which first 
appeared in the study of multiple zeta values at non-positive integers by S. Akiyama 

and Y. Tanigawa [2]. First, define the 0th row by 1, i, (We count the 

number starting from 0 instead of 1 for convenience.) Then, we define the first row 

by 1 • (1 — 5 ), 2 • (| — |), 3 • (i — ^), In general, if we denote by a, um the 

/ 77 th entry of the nth row, define the /nth entry of the ( n + l)st row a n +\ m by 
(m + 1) ■ (a„ m — a n m +i). In other words, we take the difference of the adjacent 
entries and multiply by (position of the entry +1), and place them in the next row. 
Then, the leftmost entries of the rows, 1, |, |, 0, — , are nothing but the 

Bernoulli numbers. 

Taking Proposition 2.10 into account, it suffices to show the following proposi- 
tion to prove this fact. 

Proposition 2.11. Given a.Q m (m = 0, 1, 2 ), define a„ (n > 1) by 


a„, m = (m + 1) ■ - a n - t, m +i) (n>l,m> 0). 


^ 72,0 


^(-l)'"m! 


m= 0 


Then, 


n + 1 
m + 1 


@Q,m ■ 
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Proof. We use the generating function 


&n(f) — ^ ' &n,mt • 

m = 0 


By the recurrence formula for a n , m , we have for n > 1 


5 n (0 — ^ ^ l,m ^72— l,m+l)^ 


727=0 


, oo , oo 

= a n - hm t m+1 ) - — ( ^ a„_ lim+1 ? m+1 ) 


dt 


d d 

= -J t (tgn-l(t))~ —(g„-l(t)-a n - h o) 
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= gn-l(t) + (t~ — )) 

at 

= - i)g„-i(t)). 

Thus, if we put ( t — 1 )g„(t) = we have 


h„(t ) = (t - 1) — (/t„-i(0) (« > 1). 

at 


From this we obtain 


h n (t) 


= (.<- 


I <«»■ 


It follows from Proposition 2.6 (4) (with x replaced by t — 1 ) that 


hn(t) = £ 
m = 0 

Putting t = 0, we have 

-l)" I m!(a 0 ,m-i -«o,m) 


(0 


(i-ir hrr *o(o 


-a«,0 = n l* -1 

= X) 1 ,)(- 1 )" !+1 0« + - X! I” 1 (_1 

m + 1 m 

777 = 0 1 ~=n J 

" / 

= - ^(-l) m m!ao, m l(m + 

777=0 ' 

n 


0 


n 

m + 1 


:)) 


m=0 


n + 1 

OT + 1 


^0 • 


This proves the formula in the proposition. 

Remark 2.12. We note that, if we use instead the algorithm 


t^n+l.m — (ta 1 ) * 1 dn,m)i 

we obtain Bernoulli numbers with the other convention ( B\ = ^). 
Exercise 2.13. Prove the following. 

(1) {"J = 0 when mn < 0. 


\ m rn\a 0 , m 


□ 
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(2) {I,!,} = (777- 1)! form = 1,2,3,.... 



Exercise 2.14. Formulate the statements corresponding to the previous exercise for 
the Stirling numbers of the first kind, and prove them. 

Exercise 2.15. Prove the formula e l<lg(l+ ' 1 — 1 = t for the composition of formal 
power series by using Proposition 2.6 (9). 

Exercise 2.16. Prove the following formula for the sum of powers: 



Exercise 2.17. Compute the first several B n using the formula in Theorem 2.8. 

Exercise 2.18. In the Akiyama-Tanigawa triangle (on p. 37), what are the numbers 
next to the leftmost entries? 


Chapter 3 

Theorem of Clausen and von Staudt, 
and Rummer’s Congruence 


3.1 Theorem of Clausen and von Staudt 


The denominators of the Bernoulli numbers can be completely determined. This is 
due to Clausen 1 [26] and von Staudt 2 [96]. More precisely, the “fractional part” of 
B n is given by the following theorem. This result gives a foundation for studying 
p- adic properties of the Bernoulli numbers. It also plays a fundamental role in the 
theory of /t-adic modular forms through the Eisenstein 3 series [82], 

Theorem 3.1. For n = 1 and for any even integer n > 2, B„ can be written as 


B 


n 


E 

p '.prime 
p-l\n 


h C„ 

P 


(C„ is an integer). 


Here, the sum runs over all the prime numbers p such that p — 1 divides n. ( For 
integers a and b, the symbol a \ b means that a divides b, and a \ b means that a 
does not divide b. ) 

Proof. The statement is clearly true for n = 1. Let n be an even integer greater 
than or equal to 2. Since the numerator of each term (— l)" ! m!{”}/(m + 1) in the 
right-hand side of the formula in Theorem 2.8 is an integer, the only prime numbers 


'Thomas Clausen (bom on January 16, 1801 in Snogebaek, Denmark — died on May 23, 1885 in 
Doipat, Russia (now Tartu, Estonia)). 

2 Karl Georg Christian von Staudt (bom on January 24, 1798 in Imperial Free City of Rothenburg 
(now Rothenburg ob der Tauber, Germany) — died on June 1, 1867 in Erlangen, Bavaria (now 
Germany). 

3 Ferdinand Gotthold Max Eisenstein (bom on April 16, 1823 in Berlin, Germany — died on October 
11, 1852 in Berlin, Germany). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_3, © Springer Japan 2014 
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that contribute to the denominator of B„ are divisors of m + 1 . It follows from this 
that a prime number p such that p > n + 1 cannot be a divisor of the denominator 
of B„. 

First suppose that m + 1 is a composite number, and let m + 1 = ab, 1 < a,b < 
m. If a ^ b, then ab divides ml, and thus (— 1 ) m m!{^}/(m + 1) is an integer. 
If a = b and 2 a < m, then a and 2 a divide nil, and thus a 2 = m + 1 divides ml. 
This implies that (— 1 )'"m!{^}/(m + 1) is an integer. If a = h and 2 a > m, then 
2a > m + 1, and thus m + 1 = a 2 > 2a > m + 1. This implies a 2 = 2a, or a = 2. 
In this case m = 3, and this term appears when n > 3. In this case (n even), from 
Proposition 2.6 (6) on p. 30 we have 


(— 1 ) m m \ 




= 0 - 3 + 3 • 2” - 3" 


= 1 — (—1)" = 0 mod 4. 


(For integers a,b and a natural number 4 n, if n \ (a — b), then we say that a is 
congruent to b modulo n, and we write a = b mod n. If n \ (a — b), then we 
write a ^ b mod n.) This shows that (— l) m w?!{ ; " ; }/(;jt + 1 ) is an integer. We have 
thus shown that if m + 1 is a composite number, (— l) m w!{" }/(m + 1) is an integer. 

Next, suppose that m + 1 equals a prime number p. Since p — 1 < n, it follows 
from Proposition 2.6 (6) that 


(— l) m w?! 




l n 


p - 1 

= yr mod p (since = (— 1) ; mod p) 

1=0 


— 1 mod p if p — 1 divides n, 

0 mod p if p — 1 does not divide n . 


(3.1) 


The last congruence can be seen as follows. If p — 1 divides n, then the fact that 
^2f =0 l n = p — 1 = — 1 mod p follows from Fermat’s 5 Little Theorem; i.e., 
l p ~ l = 1 mod p if l and p are relatively prime. If p — 1 does not divide n, 
then choose c such that c" ^ 1 mod p (for example, a primitive root mod p ). 
When / runs from 1 through p — 1, cl mod p also runs from the classes of 1 
through p — 1, and thus we have ^f =0 (< ?/)" = ^f=o Therefore, we have 


throughout the book, the term “natural number” means a positive integer. 

5 Pierre de Fermat (bom on August 17, 1601 in Beaumont-de-Lomagne, France — died on January 
12, 1665 in Castres, France). 


3.2 Kummer’s Congruence 


43 


(c n — 1) J2i=o l" = 0 mod P > an d X!f=o = 0 mod P- We thus see that if 
m + 1 = p, (— 1 ) "' /7 ? ! { } / (/m + 1) is an integer if p — 1 does not divide ». If, on 
the other hand, p — 1 divides //. then (— 1 )"'/n ! J'^ J / (m + 1 ) is not an integer but the 
above congruence shows that it becomes an integer if we add j to it. This completes 
the proof. □ 

The theorem of Clausen and von Staudt completely describes the denominators 
of Bernoulli numbers. Then how about the numerators? Not much is known about 
this. Here we briefly explain the notion of (ir)regular primes, which are related to 
the numerators of Bernoulli numbers. 

A prime p is said to be regular if p does not divide any of the numerators of the 
Bernoulli numbers B 2 . B 4 , ... . B p - 3 . Otherwise p is called irregular. According 
to Kummer, p is regular if and only if the class number of the /;th cyclotomic field 
m P ) is not divisible by p (cf. Washington [100, §5.3]). It is well known that this 
work of Kummer originated from his attempt to solve Fermat’s last theorem. For 
this topic, see [31] or [79]. 

For instance, there are only three irregular primes less than 100: 37, 59, 67. 
Kummer determined (by hand!) all irregular primes less than or equal to 163, and 
moreover determined the indices of B„ whose numerators are divisible by a given 
irregular prime. The next few irregular primes after 67 are 101, 103, 131, 149, 157, 
and p=\51 is the first prime which divides two of the numerators of 

B 2 .B 4 B p - 3 (see the table in [100] for irregular primes p less than 4,001 

and indices of Bernoulli numbers divisible by p). 


3.2 Kummer’s Congruence 

The goal of this section is to prove the following theorem, which plays a crucial role 
in the arithmetic of cyclotomic fields and the theory of p- adic L-functions. For a 
prime number p, we denote by Z (pl the ring of rational numbers whose denominator 
is prime to p. Using such a ring, it is easy to describe how many times p divides the 
numerator of a rational number, or decide whether or not two rational numbers are 
congruent modulo a power of p. The set of invertible elements, , of the ring /( P ) 
is the set of rational numbers whose numerator and denominator are both relatively 
prime to p. For two elements x and y, we say that the congruence relation j t = y 
mod p a holds if the numerator of x — y is divisible by p a . 

Theorem 3.2. Let p be an odd prime. 

(1) Suppose n is a positive even integer not divisible by p — 1. Then we have 


44 


3 Theorem of Clausen and von Staudt, and Kummer’s Congruence 


(2) Let a > 1 be a natural number and m,n be two even integers that satisfy 
a + 1 < m < n. Suppose that m and n are not divisible by p — 1, and that 
n = m mod (p — 1 )p a ~ l . Then we have 

Bn B m 

— = — mod p . 
n m 


If we remove the condition m > a + 1, then the above congruence relation 
does not hold in general (except for the case a = 1, where m > a + 1 gives no 
restriction) and a certain correction is necessary (see Theorem 1 1.6 on p. 198 for a 
more precise statement). This corrected version is what is known today as Kummer’s 
congruence. However, the version that Kummer 6 himself gave was different (except 
when a = l) from these theorems we are presenting (see [64]). The formulation of 
Kummer’s congruence seems to have been changed since a natural interpretation 
of the congruence by p-adic integrals was discovered. Kummer’s own proof is 
elementary, and we can derive the above congruence relation using his method. The 
complete form of the congruence will be proved using a p-adic integral in Chap. 11. 
We first prove the following lemma. 

Lemma 3.3. Let p be an odd prime number, and let < p(t) e Z( P ) [[?]], r, s e Z ( p y 
Develop the formal power series obtained by substituting e rt — e st = ( r — s)t + 
~ s - 1 2 + ■ ■ ■ for t in tp(t) : 


<p(e rt ^ e st ) = Y J A i 

77=0 


t 


77 


n\ 


Then we have A„ € 'L(p), and for any m,a ( m > a > 1), the congruence 
Afn+tp— \)p a ~ i = Am mod p 


holds. 

Proof. Write (p(t ) = a kt k ■ Then, we have 


cp(e rt -e st ) = Y^,a k {e rt - e st ) k 
k = 0 


k = 0 h=0 

oo k 

= EEh)' 


1 h. 


e rt(k-h) e sth 


k=0 h=0 


ak W e 


( r{k—h)+sh)t 


6 Emst Eduard Kummer (bom on January 29, 1810 in Sorau, Brandenburg, Prussia (now 
Germany) — died on May 14, 1893 in Berlin, Germany). 


3.2 Kummer’s Congruence 


45 


We obtain A„, by taking the derivative in times with respect to t, and putting t = 0: 

oo k 

A m = J2J2 ( - I)l ' ak 

k=0 /,= 0 

Since ( e rt — e st ) k begins with the term t k , the sum with respect to h in the above 
formula equals 0 if k > m. Thus, the above sum with respect to A: is a finite sum. 
Replacing m by m + (p — 1 )p a ~ l , and subtracting A m , we have 

Ajn+(p— \)p a ~ 1 Am 

oo k 

= ££<-«*« 

k=0h=0 

If r{k — h) + sh is divisible by p , then since m > a, ( r(k — h) + sh) m is divisible 
by p a . If r(k — h) + sh is not divisible by p, then ( r(k — h ) + sh) < ' p ~ x ' ) p a 1 _ ] 
is divisible by p a since the order of (Z / p a Z) x is ( p — 1 ) p“~ l . In either case the 
right-hand side is divisible by p a , and thus 

A m+(p _ l)pa - 1 = A m mod p a 


I (r(k -h) + sh) m ((r(k - h) + sli) (p - l)p “ ' - 1). 


\(r(k-h) + sh)" 


holds. □ 

Proof ( of Theorem 3.2). For a natural number c ^ 1 relatively prime to p. consider 


1 


(pit) = - 


t (l+f) e -l 


Since c e Z^, the expression 


(l + f) c -l c-1 , 

=t + ——t- + 

c 2 


belongs to Z( p )[[f]], and it is invertible in Z ( P )((f )) (at the end of Sect. 1.3 on p. 20). 
From 


c 1 c — 1 c 2 — 1 

= — — “|- 1 -j“ 

(1+0 C -1 t 2 12 


we have cp{t) € Z( p ) [[?]]. Since 


t te‘ t t" 

= — t = 1 — — T / Bn — , 

'-1 e‘-l 2 n\ 

n=2 
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we have 


cpie'-l) 


1 

e> - 1 

1 — c 
2 



t ct \ 



Bn t n ~ l \ 
n (« — !)!/ 


Thus it follows from the lemma (r = I . ,v = 0) that ( 1 — c") — e Z (/ ,) , and if n = m 
mod (p — 1) p a ~ l , then 


B„ B m 

(1 -c n )— = (1 -c m )— mod p a . 
n m 

Here, we choose c to be a primitive root modulo p (c mod p is a generator of 
(Z/ p Z) x ). If n is not divisible by p — 1, then m is not divisible by p — 1, and 
we have 1 — c", 1 — c m e (Z/ p a Z) x with 1 — c" = 1 — c m mod p a . This gives 
^ e Z (/) ) and the congruence in the theorem. □ 


3.3 Short Biographies of Clausen, von Staudt and Kummer 

Thomas Clausen, an astronomer and a mathematician, was born on J anuary 16, 1801 
in Snogbaek, Denmark. He was educated by a local pastor Georg Holst before he 
got a position as an assistant at the Altona observatory, where Schumacher 7 8 * was the 
head. He then moved to the Joseph von Utzschneider Optical Institute in Munich as 
the successor to Fraunhofer, who is famous for “Fraunhofer lines” in physics and 
optics. In 1842, he was appointed observer at the Dorpat observatory in Russia, and 
there he remained until his retirement in 1872. The Copenhagen Academy awarded 
Clausen a prize for his work on the determination of the orbit of the 1770 comet 
(published in Astronomische Nachrichten in 1842). Bessel 10 highly praised this 


7 The description in this section is based on Biermann [17, 18], Noether [74], Hensel [40], and 
Lampe [65], 

s Heinrich Christian Schumacher (bom on September 3, 1780 in Bramstedt, Germany — died in 
1850). His first degree was in law, and after that he studied astronomy under Gauss and became 
an astronomer. He launched the journal Astronomische Nachrichten, which is famous for the 
contributions of Abel and Jacobi on the theory of elliptic functions, as the managing editor. 
Incidentally, Abel once met Clausen during his stay in Hamburg (cf. [19]). 

^Joseph von Fraunhofer (born on March 6, 1787 in Straubing, Germany — died on June 7, 1826 in 
Munich, Germany). 

10 Friedrich Wilhelm Bessel (bom on July 22, 1784 in Minden, Westphalia (now Germany) — died 
on March 17, 1846 in Konigsberg, Pmssia (now Kaliningrad, Russia)), an astronomer, who is 
famous for the Bessel function. 
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work. In 1854 he received through Gauss 11 a corresponding membership from the 
Gottingen Academy. In 1 856 he received the same class of membership from the St. 
Petersburg Academy. Clausen published approximately 150 papers on a multitude 
of subjects. Still today one sees his name in such terms as Clausen’s identity 12 in 
the theory of hypergeometric series [23], and Clausen’s function (a variant of the 
dilogarithm function) [24], Also, he computed n up to 250 decimal places, 13 and 
found the prime factorization of the sixth Fermat number 2 2 + l. 14 The actual 
factorization is 2 26 + 1 = 274177 ■ 67280421310721 and, as Clausen commented, 
the second factor of this is the largest prime number known to that day (letter to 
Gauss on January 1, 1855). He wrote to his friend and astronomer C. A. F. Peters 
in Konigsberg that he had found a new method of factoring a large number, and as 
an example he wrote (10 17 — l)/9 = 2071723 ■ 5363222357, which was not known 
before and had taken some interest in connection to the period length of the periodic 
decimal of 1 / p. However, he never published his “new method” and we have no 
idea what kind of method he had found. 

Clausen never married, and died on May 23, 1885 in Dorpat, now Tartu in 
Estonia. 

Karl Georg Christian von Staudt was born on January 24, 1798 in Rothenburg- 
ob-der-Tauber, Germany, as a son of a wealthy family. From 1818 to 1822, von 
Staudt studied mathematics at Gottingen under Gauss. He was also interested in 
astronomy (at that time Gauss was also the head of the observatory), and, as early 
as in 1820, he computed the positions of Mars and the asteroid Pallas, and in 1821 
he computed the orbit of a comet. The accuracy of the computation was highly 
praised (described as an “outstanding proficiency”) by his master Gauss. He had 
been holding positions as a high-school teacher at Wurzburg and Niirnberg until he 
was appointed professor at Erlangen in 1 835, where he remained throughout his life. 

His main contribution in mathematics is in Geometrie derLage (1847, Geometry 
of Position), which is the title of his book and is now called projective geometry. As 
for Bernoulli numbers, he discovered not only the theorem introduced in Sect. 3.1 
[96, found independently of Clausen], but also, according to Noether 15 [74], several 
other results including the formula in Theorem 2.8 (p. 35) and even Rummer’s 


"Johann Carl Friedrich Gauss (bom on April 30, 1777 in Brunswick, Duchy of Brunswick (now 
Germany) — died on February 23, 1855 in Gottingen, Hanover (now Germany)). 


n iFi(a, fi,a + f + \',x) 2 = }F 2 


( 2a,2p, a +p . 

\^+/3+l,2a+2/?’ A 


13 It was correct up to 248 decimal places. This was the world record from 1847 to 1853. For a 
history of computation of n, see for example [ 13]. 

14 Fermat conjectured that all the numbers of the form 2 2 ” + 1 (which are called Fermat numbers) 
are prime. This is tme for n < 4, but Euler showed in 1732 that this is not the case for n = 5 by 
giving the factorization. As of November 2013, it is known that every Fermat number 2 2 + 1 with 
5 < n < 32 is not prime. (Complete factorization is known up to n = 11.) 

15 Max Noether (bom on September 24. 1844 in Mannheim, Germany — died on December 13, 
1921 in Erlangen, Germany). 
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congruence (modulo prime number case). He presented these results in two papers 16 
submitted to Erlangen University on the occasion of becoming a member of the 
senate and the faculty. These results, however, caught no attention because, as 
Noether put it, von Staudt was so modest and did not care about the circulation of 
the papers (cf. [88] for the contents of these two papers). He also published a paper 
[97] on a proof of the "fundamental theorem on algebra” of Gauss (Clausen also 
wrote a paper on the same subject [25]). He wrote up a paper one week before his 
death, while suffering from asthma. He died on June 1, 1867. Noether quotes from 
the funeral address the passage “his endurance, prepared for the lifelong effort, not 
for months or years, to pursue a distinct goal”, which he thinks describes well the 
character of von Staudt. 

Ernst Eduard Kummer was born on January 29, 1810 in Sorau, Germany 
(now Zary, Poland). Kummer entered the Gymnasium in Sorau in 1819 and 
the University of Halle in 1828. He soon gave up his original study, Protestant 
theology, under the influence of the mathematics professor Scherk 17 and applied 
himself to mathematics. In 1831 he received a prize for his essay on the question 
posed by Scherk: “De cosinuum et sinuum potestatibus secundum cosinus et sinus 
arcuum multiplicium evolvendis”. 18 Kummer taught from 1832 until 1842 at the 
Gymnasium in Liegnitz (now Legnia, Poland), mainly mathematics and physics. 
His students during this period included Kronecker and Joachimsthal, 19 both of 
whom became interested in mathematics through Kummer’s encouragement and 
stimulation. In particular, the influence of Kummer upon Kronecker was enormous 
and they became lifelong friends. One can see the fragments of their friendship in 
their correspondence collected in Kummer’s Werke. 

The subjects of Kummer’s mathematical study are rather clearly divided accord- 
ing to the periods. The first is in function theory, the second in number theory 
and the third in geometry. His period of Gymnasium teaching coincided with his 
creative period in function theory. The most important was the famous paper on 
the hypergeometric series [63]. Kummer sent this paper to Jacobi, which led to his 
scientific connection with Jacobi and with Dirichlet. 20 In 1839, through Dirichlet’s 
proposal, Kummer became a member of the Berlin Academy of Sciences, and in 
1840, he married Ottilie Mendelssohn, a cousin of Dirichlet’s wife. 21 (She died in 
1848 and later Kummer married Bertha Cauer.) On the recommendation of Dirichlet 
and Jacobi, Kummer was appointed full professor at the University of Breslau (now 


16 Under the same title “De numeris Bemoullianis”, Erlangen, 1845. 

17 Heinrich Ferdinand Scherk (bom on October 27, 1798 in Poznan, Poland — died on October 4, 
1885 in Bremen, Germany), he also wrote a paper on Bernoulli numbers [81], 

18 “On expansions of powers of cosine and sine by cosine and sine with their arguments multiplied.” 
'^Ferdinand Joachimsthal (bom on March 9, 1818 in Goldberg, Prussian Silesia (now Zlotoryja, 
Poland) — died on April 5, 1861 in Breslau, Germany (now Wroclaw, Poland)). 

20 Johann Peter Gustav Lejeune Dirichlet (bom on February 13, 1805 in Diiren, French Empire 
(now Germany) — died on May 5, 1859 in Gottingen, Hanover (now Germany)). 

21 Dirichlet’s wife Rebecca Mendelssohn is a younger sister of the composer Felix Mendelssohn. 
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Wroclaw, Poland) in 1842. His second period of research, dominated especially by 
number theory and lasting approximately 20 years, began about this time. 

In 1855, Gauss died and Dirichlet left Berlin to succeed him at Gottingen. In the 
same year Kummer was appointed professor at Berlin to succeed Dirichlet. 

In his third period, starting from around 1860, he studied geometry, his most 
famous work being the study of the quartic surface known today as the Kummer 
surface. (He also published papers on the arithmetic of cyclotomic fields until the 
mid 1870s.) 

Kummer’s popularity as a professor, based not only on the clarity of his lectures 
but on his charm and sense of humor as well, attracted a great number of students. 
Kummer was first Gutachter for 39 dissertations at Berlin. Of his doctoral students, 
seventeen later became university teachers, several of them famous mathematicians: 
Fuchs, 22 du Bois-Reymond, 2 ’ Gordan, 24 Bach matin, 2 ’ Schwarz 26 (Kummer’s son 
in law). Cantor, 27 Schonflies. 28 Kummer was also second Gutachter for thirty 
dissertations at Berlin. 

He retired in 1883 and was succeeded by Fuchs. Kummer died on May 14 in 
1893, 10 years after his retirement. 

Exercise 3.4. Compute the integer C„ in Theorem 3.1 for several n. What is the 
first n such that C„ ^ 1? (Note that lim„-H.oo \ C„\ = oo because C„ is very close 
to B„ .) 

Exercise 3.5. Suppose n is even and n > 4. Prove the following. 

(1) 2 B„ e Z(2). 

(2) The congruence 2 B n = 1 mod 4 holds. Hint: Use the formula in Theorem 2.8 
and look at each term on the right modulo 4. 

Exercise 3.6. Use Theorems 3.1 and 3.2 (1) to prove that the tangent number T„ at 
the end of Chap. 1 is an integer. 


22 Lazarus Immanuel Fuchs (born on May 5, 1 833 in Moschin, Prussia (now Poznan, Poland ) — died 
on April 26, 1902 in Berlin, Germany). 

23 Paul David Gustav du Bois-Reymond (bom on December 2, 1831 in Berlin, Germany — died on 
April 7, 1889 in Freiburg, Germany). 

24 Paul Albert Gordan (bom on April 27, 1837 in Breslau, Germany (now Wroclaw, Poland) — died 
on December 21, 1912 in Erlangen, Germany). 

25 Paul Gustav Heinrich Bachmann (bom on June 22, 1837 in Berlin, Germany — died on March 
31, 1920 in Weimar, Germany). 

26 Karl Herman Amandus Schwarz (bom on January 25, 1843 in Hermsdorf, Silesia (now 
Poland) — died on November 30, 1921 in Berlin, Germany). 

27 Georg Ferdinand Ludwig Philipp Cantor (bom on March 3, 1845 in St. Petersburg, Russia — died 
on January 6, 1918 in Halle, Germany). 

28 Arthur Moritz Schonflies (bom on April 17, 1853 in Landsberg an der Warthe, Germany (now 
Gorzow, Poland) — died on May 27, 1928 in Frankfurt am Main, Germany). 


Chapter 4 

Generalized Bernoulli Numbers 


In this chapter we introduce generalized Bernoulli numbers and Bernoulli 
polynomials. Generalized Bernoulli numbers are Bernoulli numbers twisted by 
a Dirichlet character, which we define at the beginning of the first section. Bernoulli 
polynomials are generalizations of Bernoulli numbers with an indeterminate. These 
two generalizations are related, and they will appear in various places in the 
following chapters. 


4.1 Dirichlet Characters 


Let us define a Dirichlet character as a map from the set of integers Z to the set of 
complex numbers C. 

Definition 4.1. Let / be a natural number. A function / : Z — > C is called a 
Dirichlet character modulo / if it satisfies the following three conditions: 

(i) / is multiplicative; i.e. x( a b ) = X( a )x(b) f° r an Y a > b € Z. 

(ii) The value of /(a) only depends on a mod / ; i.e, /(r/ + bf) = /(a) for any 
a, b e Z. 

(iii) If ( a , /) = 1, then /(a) ^ 0, and if ( a , /) ^ 1, then /(a) = 0. Here, (a, f) 
stands for the greatest common divisor of a and /. 

In particular, the function Xo defined by 


1 («,/)= 1 

0 


satisfies the three conditions. We call this the trivial character (modulo /). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_4, © Springer Japan 2014 
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For a natural number /, let (p(f) be the number of integers from 1 to / which 
are relatively prime to /. This is called the Euler (totient) function. If a is an integer 
relatively prime to / , then we have 

q'K/) = i mo( j y 

This is known as Euler’s theorem in elementary number theory. For a proof see for 
example Ireland and Rosen [50], 

A Dirichlet character / modulo / satisfies the following properties. 

Lemma 4.2. (1) If a = 1 mod f, then y(a) = 1. 

(2) If (a, f) = 1, then x(a)^ f) = 1. 

( 3 ) 


cp(f) if X = Xo ( trivial character), 

0 ifX ± Xo- 

The summation is taken over all representatives a modulo f . 

Proof (1) Putting b = 1 in condition (i), we have y(a) = x( a )x(^)- If a = 
1 mod /, then we have y{a) 7 ^ 0 from (iii). Thus, we have /(l) = 1, which in 
turn implies y(a) = 1 from (ii). 

(2) Suppose (a, f) = 1. From Euler’s theorem we have = 1 mod /. Thus, 

X(af (n = x(a vU) ) = *( 1 ) = 1 . 

(3) Since 1 p(f) is the number of a mod / such that (a, /) = 1, the assertion 
holds if x is the trivial character. If / is not the trivial character, then there 
exists b such that x(b) f 0, 1. If we put S = mod f /(a), then we have 
X(b)S = mod / x(b)x(a) = mod f x(ba). When a runs through all 
the representatives mod /, so does ba since (b, f) = 1. Thus, the sum on the 
rightequals S. This implies (l — x(b))S = 0andhenceS = 0, as 1 — x(b) 7 ^ 0. 

□ 

We note that a Dirichlet character defines naturally by definition a character of 
the multiplicative group (Z//Z) x . 

Now suppose f | /, and let y' be a Dirichlet character modulo f. If y satisfies 
y(a) = y'(a) if (a, f) = 1 , 

then we say that y is defined mod / '. For a Dirichlet character y modulo /, the 
smallest f such that y is defined mod /' is called the conductor of /, and is 
written as f x . By definition, f x is a divisor of /. 

Definition 4.3 (primitive character). A Dirichlet character y modulo / is called 
primitive if f = f x - 


X( a ) = 
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Let x be aDirichlet character modulo /. Since /(— l) 2 = /((— l) 2 ) = /(l) = 1, 
we see that /(— 1) = ±1. 

A Dirichlet character with /(— 1) = 1 is called an even character. A character 
with x(—l) = —1 is called an odd character. Given a Dirichlet character /, we 
define the complex conjugate character / by 


X(a) = /(a). 


where /(a) stands for the complex conjugate of /(a). We have f x = fj. 

Example 4.4. Define Xj ■ Z — * C ( / = 0, 1, 2, 3) as follows. 

If (c/,2) ^ 1 (i.e. 2| a), then /y(c/) = 0 (J = 0, 1,2,3). When a is odd, for each 
a mod 8, x( a ) is given in the following table: 


XV 

1 mod 8 

3 mod 8 

5 mod 8 

7 mod 8 

Xo («) 

1 

1 

1 

1 

Xi( a ) 

1 

1 

-1 

-1 

X 2 (a) 

1 

-1 

-1 

1 

/ 3 (a) 

1 

-1 

1 

-1 


Then, X j ( j = 0, 1 , 2, 3) are all the Dirichlet characters modulo 8. The character xo 
is the trivial character with f Xo = 1 and xi, Xi are primitive characters modulo 8, 
but X 3 (= X\l'>) is not primitive, with / /3 = 4. 


4.2 Generalized Bernoulli Numbers 


Given a Dirichlet character / modulo /, the generalized Bernoulli numbers B n / 
are defined by using the generating function 

v- X(a)te at _ ^ t" 

2 ^ ef t_ l ~ 2 ^ "».*„!• 
a= 1 n=0 


In the following we assume / > 1 and x is not the trivial character unless otherwise 
stated. 

Since we have the expansion 


te at 1 / a 

e ft -l = f + V7 


t 


+ 


( 
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it follows from Lemma 4.2 (3) that 

Bo, x = 0, 

/ 


1 

B u = 7 2^x (a "> a ' 

' a= 1 


Bi ,x = 7 E x^ 2 - X! 

* a= 1 a=l 

1 f 3 / f f 

b xx = 7 X! - 9 E + 7 X! 

^ a= 1 a = l a=l 

A similar formula for /?„,/ is given as 


B ".x = f "- 1 E x^ B » («//) • 


(4.1) 


where B n (x) is the Bernoulli polynomial, which will be defined in the next section. 
(See Remark 4.10 (2) in the next section.) 

We have seen that all the Bernoulli numbers with odd indices greater than 1 are 0. 
For generalized Bernoulli numbers we have the following. 

Proposition 4.5. Let / be a non-trivial character. Then, for any n satisfying 
(— 1)' I_1 = /(— 1), we have B„ x = 0. In other words, if y A an even character, 
then B n/ with odd indices n are 0; if y is an odd character, then B n / with even 
indices n are 0. 

Proof. Since y is non-trivial, we can rewrite the generating function as follows: 


E y{a)te at = E y(f -a)te (f a)t 
g/r _ 1 2—/ e fi - 1 

a= 1 <3 = 1 


/-i 

= /(-!) E 


a= 1 


X(a)te at 
1 - e~f* 


/-i 

= /(-!) E 


a= 1 


X(a)(-t)e a( - 

e n-n - l 


It follows immediately from this that the generating function is an even function if 
/(— 1) = 1, and an odd function if /(— 1) = — 1. □ 
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Remark 4.6. For any n satisfying (— 1)" = /(— I ), it is known that B n/ is non-zero. 
This can be seen as follows. From Theorem 9.6 on p. 148, B„ x is non-zero if and 
only if L(«,/) is non-zero. We can prove L(n , /) ^ 0 using the Euler product 
formula for the L-function with n > 1 . The fact that L(l, /) ^ 0, i.e., B\ x / 0 for 
all X- is essentially equivalent to Dirichlet’s theorem on prime numbers in arithmetic 
progressions. Dirichlet proved his theorem using the class number formula. See 
Dirichlet [29] or Serre [83, Chap. 6]. 


4.3 Bernoulli Polynomials 

Bernoulli polynomials are often defined by means of a generating function. Here, 
we define them in a different manner. 1 Their generating function will be given in 
Proposition 4.9 (5). 

First, define a Q-linear map / : Q[.v] — > Q[x] from the polynomial ring Q[x] to 
itself by 



/(>') dy (/(x) € Q[x]) 


(difference of the primitive function). Since we see immediately that 


I(x n ) = — j-j- ((x + 1)" +1 -x" +1 ) = x" + (lower terms). 


the matrix representing / with respect to the basis l,x,x 2 ,x 3 , ... of Q[x] is an 
upper triangular matrix with the diagonal components 1 . In particular, I is invertible. 
We thus define the Bernoulli polynomials as follows. 

Definition 4.7. Define the Bernoulli polynomials B lt (x ) (n = 0, 1, 2, . . . ) by 


B n (x) := r\x n ). 

In other words, B„ (x) is the unique polynomial satisfying 

rx+l 

I(B n (x))= / B„(y) dy = x" . 

J X 

Because of the form of /(x), B„(x) is a monic polynomial (a polynomial in which 
the coefficient of the highest-degree term is 1 ) of degree n with coefficients in Q. 


1 This was suggested by Don Zagier. 
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Example 4.8. 

1 , 1 ,3,1 

B 0 (x) = 1, B i(x) = x — B 2 (x) = x —X+-, Bi(x) = x — -x + -x, 

2 6 2 2 

i ?4 (x) = x 4 — 2x 3 + x 2 , B 5 (x ) = x 5 — -x 4 + -x 3 X. 

30 2 3 6 

Proposition 4.9. (1) B n { 1) = B„ ( n > 0). Also, if n 1 , then B n { 0) = 
B„{ 1) = B„. 

(2) B n (x + 1) — B„(x ) = (n > 0). 

(3) B' n (x) = nB„-\ (x) (n > 1). 

(4) B n (\ — x) = (—1)" B„(x) (n > 0). 

(5 ) (Generating function) 


J5L t” te x1 

J2 7- 

L — ^ n\ e l — 1 

/7 — 0 

(6) (Explicit formula with Bernoulli numbers) 




(4.2) 


(7) 


ifc-t 

E = k l ~ n B n (kx) (, k > 1). 


Proof. (2) It is obtained by differentiating the 

x" with respect to x. 

(3) It follows from (2) that 


defining equation 


rx+l 

L j 


B„(y)dy = 



B’n(y) 


dy 


- ( B n (x + 1) - B n (x)) = x" 1 . 
n 


Thus, it follows from the definition that 


Kb) 

n 


= B n - i(x), 


or 


5'(x) = n B n ~\ (x). 
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(4) Since we have 


[ B„(l-y)dy = -f B n (y)dy= f B n (y) dy = {-xf = (- 1)V', 
Jx J 1-x J-X 


it follows from the definition that (— 1 ) n B n (\ — x) = B n (x). 

(5) Applying I to the coefficient of each term of the left-hand side of (4.2), we have 
J2n= 0 x n t" / n \ = e xt as formal power series in t. On the other hand, regarding 
the right-hand side of (4.2) as a formal power series, and applying / term by 
term, we have I(te xt /{e 1 — 1)) = tl(e xt )/(e r — 1) because of the linearity. 
Furthermore, since we have 



we see I(te xt / (e 1 — 1)) = e xt . The injectivity of I implies that both sides are 
equal. 

(1) The first half of (1) can be seen by putting x = 1 in (5) and by using the 
generating function of Bernoulli numbers (Theorem 1.12 on p. 20). Putting 
x = 0 in (2), we see that B n ( 1 ) = B„( 0) for /? / I . The second half of the 
assertion follows from this and the first half. 

(6) We compute the product t / (e r — 1 ) x e xt on the right-hand side of the generating 
function (4.2). Since 



we have 




This proves the formula (6). 

(7) We have 




k - 1 „*+!+! 
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ri+ l 

= k \ B„(y)dy 

•'f 

= k (^y =k l-n x n 

From this we have 

Bn 

1=0 ' ' 

The formula is obtained by letting x — > kx. □ 

Remark 4.10. (1) As we mentioned in Remark 1.3 on p. 10, it is easy to prove the 
formula for the sum of powers ((1.1) on p. 1) using the Bernoulli polynomials. 
Replacing n by k + 1 in Proposition 4.9 (2), we have 

x k = — - (B k+ i(x + 1) - B k+l (x)) . 
k + 1 

Putting x = \,2, ... ,n, and adding them all up, we have 

" i 

J2 lk = jtTT i(« + i)-**+i(D)- 


We then use Proposition 4.9 (1), (4), (6) to obtain 
B k +i(n + 1) - B k+ i(l) = (— 1 ) A+I B k+ \ (—n) - B k+l 


= (-D 


jfc+t 



Bj(-n) k+1 ~j -B k+ i 



Bjn 


k+i-j 


Since = (J) the formula (1.1) follows. 

(2) The formula (4.1) which expresses generalized Bernoulli numbers in terms of 
Bernoulli polynomials is obtained as follows. In (4.2) replace x by a/ f and t 
by ft. Then we obtain 


te ,a 
ef‘ - 1 


1 

7 


>7 = 0 


(ft) 

n\ 
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Thus, we have 


d 1 x( a ) teta t \ ^ d / 

E B »-X ffi = E 77T3T = E ^ (fl) 7 E 


n = 0 


77 ! 


oo / 


= EE X(a)B n ( a/f ) 


f n ~' t 


n= 0 a = 1 


77 ! 


Comparing coefficients, we obtain 


B,,. x = /" _1 E *(«)*»(«//)■ 


a= 1 


Next we take up a beautiful formula involving Bernoulli polynomials. For any 
real number x, we denote by [x] the greatest integer less than or equal to x. This is 
sometimes called the Gauss symbol . 

Theorem 4.11. Let k be a natural number. The formula 


B k (x - [x]) = - 


k\ 

(2 7ti) k 


E 

n€ Z 

77 7^0 


glrcinx 



(4.3) 


holds for all real numbers x ifk > 2; it holds for all real numbers x $ Z, if k = 1. 
Here, the sum is taken for all integers different from 0. Ifk = 1, the infinite sum on 
the right-hand side should be understood as 


N 


lim 

N—>oo 



77^0 


glninx 

77 


Ifk > 2, then the right-hand side of (4.3) converges absolutely and uniformly with 
respect to x. 

Proof Let 0 < x < 1 . Decompose the function 


m = 


e x i 

et- 1 


into partial fractions as a meromorphic function in £. This f(f) has poles at 
£ = 2nin (n e Z) and all of them are of order 1. The residue at £ = 2k in is 
given by 


lim (£ — 2nin)f(^) = lim t ■ 

£— *- 27 TZ 77 t — >-0 


gX (t -\- 2 n i n) 
e t+ 2 nin _ | 


g2ninx 
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Let N be a sufficiently large natural number, and put R = 2n(N + l/2). Let C,v be a 
square path passing through four corner points R + iR, —R + iR, — R—iR , R — iR 
in this order in the £ -plane. If t is a point inside Cn such that t ^ 2: tin (n e Z), 
then it follows from the residue theorem that 



Therefore, if 0 < \t \ < 2n, we have 


fit) = 


- + lim 'Y' 

t N-+oc t— 1 
n=—N 
n t^O 


glninx 

t — 2 n i n 


1 

t 


lim 


N 

E 


0 2 ninx 


2itin (1 — ~-^) 

— fj \ 2mn / 


n t^O 


l 

t 


N 


lim 

N-+oo 


E 


n=—N 


n t^O 


g2 ninx 

2nin 



Since the sum 


E E 

n=-o o k=2 
n t^O 


^2 ninx 

(2jti n) k 


converges absolutely, the above formula becomes 


fit) = lim Y' 

t N—*oo ' 


n 2 ni. 


oo oo 


- E E 


n=—N 
n t^O 


n=-oo k=2 
n 7^0 


g2 ninx 

rt 

(2 jrin) k 


2 ni n 
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Since the generating function (4.2) of Bernoulli polynomials can be regarded as the 
Taylor expansion of te xt / (<?' — 1) at t = 0 for each real number x, we have 

00 t k ~ l 

= Y B k {x) — . 
k=0 


Thus, comparing the coefficients, we obtain 


N 

B\(x ) = — lim 

N->o O 

n=—N 

n^O 



glrcinx 

2 7t in 


B k (x ) = 


-*» E 

n — — oo 
77^0 


glninx 

(2 jtin) k 


(k > 2). 


Since the sum on the right-hand side converges uniformly over the closed interval 
[0, 1] if Jt > 2, and since B k (x) is continuous in the same interval, the above formula 
holds in [0, 1], Furthermore, the right-hand side is periodic with respect to x with 
period 1 . Thus, (4.3) is obtained by extending the left-hand side. □ 

From this theorem we can obtain the values of the Riemann zeta function at 
positive even integers. (The Riemann zeta function will be treated in the next 
section.) 

Corollary 4.12. Let k be an even integer greater than or equal to 2. Then we have 



77 = 1 


1 Bk 

2 k\ 


{2ni) k . 


For example, 


(4.4) 



77= 1 




77 = 1 



77=1 




77=1 


9450’ 


etc. 


Proof. It suffices to put x = 0 in the theorem. The left-hand side is equal to B k from 
Proposition 4.9 (1). Since the right-hand side is twice the sum over the positive n 
since k is even, the formula in question holds. □ 

Remark 4. 13. The formula in the theorem can be rewritten as follows. 

(1) If k is even, 


OO 

B k (x - [x]) = 2(-l ) k ' 2 - l k\ Y 

71=1 


cos 2 nnx 


(2 rtn) k 


62 


4 Generalized Bernoulli Numbers 


(2) If k is odd, 


oo 

B k {x - [*]) = 2(-l f +1)/2 k\J2 

n = 1 


sin 2itnx 
(2nn) k 


(4.5) 


Here, we assume x is not an integer if k = 1 . 

Theorem 4.11 has an interesting application. 2 Let us consider the special values 
of the following zeta function. Let a, b be integers different from 0, and let p, q, r 
be positive even integers. Define 


S{p. q , r; a, b) 


E 

am+bn^ 0 


1 

m p n q {am + bn) r 


Here, the sum runs over all integers m , n satisfying m 0, n 0, am + bn ^ 0. 
Proposition 4.14. Let p, q. r be positive integers. Then, we have 


jz~p- q ~'' S(p, q, r\a,b) e Q. 


Proof. Define / = —{am + bn). We can express S{p, q, r\ a, b) as an integral as 
follows (we use the notation e(vu) = e lnlw )\ 


S{p. q , r; a , b) 


E 

m, n , /€ Z 

in, n, I t^O, am+bn-\-l=0 


l 

m p n q l r 




m,n,l €Z 
, «, / ^0 


jf £ 


e{{am + bn + l)x) 
m p n q l r 


dx 


m,n, l €Z 
m, n, l t^O 


e{{am + + /)x) 

m p n q l r 


dx 


jf? 


m€ Z 
#0 


e{amx ) 




E 

n€Z 

H t^O 


e{bnx) 

n q 


e(/x) 

> — fl.T 

/e Z 
M0 


Here, we used the fact that the series v{antx)/m p converges uniformly over 

the closed interval [0, 1], because p. q , r > 2. Using the formula (4.3), we have 


2 This is based on a lecture of Don Zagier at Kyushu University in 1999. 
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(_1) P V (2 7T) p+q+r /* 1 

S(p,q,r;a,b) = / BJax— [ax])Ba(bx— [bx])B r {x) clx. 

p\q\r\ J 0 

Thus, the value of the integral is a rational number (recall that the Bernoulli 
polynomial is a polynomial with rational coefficients), and the assertion of the 
proposition follows. □ 

Example 4.15. (p = q = r = 2, a = b = l.) From the above formula, 

(2n) 6 f l 

5(2, 2, 2; 1, 1) = — — — / B 2 (x) 3 dx. 

8 Jo 


Integration by parts using B[(x) = 1, B' 2 (x) = 2B\(x) gives 


f B 2 (x) 3 dx = f B[(x)B 2 (x) 3 dx 
Jo Jo 

= [Bi(x)B 2 (x) 3 ~^ 0 - [ 6Bi(x) 2 B 2 (x) 2 dx 
Jo 

= B\ — 2 f [B\{x) 3 )' B 2 {x) 2 dx 

Jo 

= B\-2 [Bi(x) 3 B 2 (x) 2 ] ‘ | f (Bi(x) 5 )' B 2 (x) dx 

3 Jo 

= B 2 - \ B l + J [5i(^) 5 B 2 (x)]o - y j B\ (x) 6 dx 

= Bl-\ B2 + ^ B2 J^[B^i 
1 


Thus, we have 


2 2 • 3 3 • 5 • 7 ' 


5(2, 2, 2; 1, 1) 


2it° 


3 3 • 5 • 7’ 


Exercise 4.16. For a non-trivial character / modulo / > I, write /i 4 / and B 2 / by 
i x( a ) a> with 1 = 2, 3, 4, 5. 

Exercise 4.17. Prove that the Bernoulli polynomial B„(x) for odd n > 1 is always 
divisible by x(x — \)(x — 1). 



Chapter 5 

The Euler-Maclaurin Summation Formula 
and the Riemann Zeta Function 


Aside from the formula for the sum of powers, the most basic topics in which 
Bernoulli numbers appear is the Euler-Maclaurin 1 summation formula and the 
values of the Riemann zeta function at integer arguments. In this chapter we survey 
these topics. 


5.1 Euler-Maclaurin Summation Formula 

The Euler-Maclaurin summation formula gives a very effective tool for evaluating 
a sum of values of a function at integers. 

Theorem 5.1. Let a and b be integers satisfying a < b, and let M be a natural 
number. Suppose f(x ) is an M times continuously differentiable function 2 over 
[a. b]. Then, we have 


h 


M-\ 

(k+l)\ yJ w J K 






Here, B ^+ 1 is the Bernoulli number and Bm(x) is the Bernoulli polynomial, and 
the sum on the right-hand side is understood to be 0 if M = 1. 


'Colin Maclaurin (bom in February 1698 in Kilmodan, Cowal, Argyllshire, Scotland — died on 
June 14, 1746 in Edinburgh, Scotland). 

2 Differentiable M times and the Mth derivative is continuous. 

T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 65 

in Mathematics, DOI 10.1007/978-4-431-54919-2_5, © Springer Japan 2014 
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Remark 5.2. (1) Regard f(n) as the area of a rectangle whose base is the interval 
[n — n + 7 ] and whose height is / («). Then the sum of the first two terms 
on the right-hand side is the first approximation, so to say. The next term can 
be seen as the higher approximation in terms of the data at the end points of the 
interval. 

(2) If we use the values of the Riemann zeta function at negative integers £ (—k) = 
—B k +i/(k + 1), which will be proved later, then the formula looks like 




Proof. Let g(x) be an M times continuously differentiable function on [0. 1]. Since 
B\ (x) = x — 1/2, we have B[ (x) = 1. Thus, using integration by parts, we have 




Iterating integration by parts using B k (x) = B' k+l (x) / (k + 1) (Proposition 4.9 (3) 
on p. 56), we obtain 


j g(x)dx = X - (g(l) + g(0)) - X - [B 2 (x)g'{x)\ ‘ B 2 (x)g"(x ) dx 

= \ (f (1) + g(0)) - [5 2 (.v)g'(x)] ( 1 ) 

B 2 {x)g"' (x) dx 




Now, if k > 2, then B k ( 0) = B k ( 1) = B k (Proposition 4.9 (1)) and {-\) k B k = B k 
(Proposition 1.4 on p. 10). Thus, we have 
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/* 1 j Xt 1 n 

/ g(x)t/x = - (g(l) + g(0)) - ^ | + l v ( g (Ar)(1 ) _ ? W (°)) 

(—\\ M r x 

+ M , J B M (x)g (M) {x)dx. 

Therefore, 

1 1 M — 1 d 

^ te(l) + g(0)) = / g(x)dx + ^ + + i\^ g(t) ^^ _ 

i_n M r 1 

B M (x)g {M] (x)dx. 

Now, we take f(x + n) (a < n < b — 1 ) for the function g(x). Since the function 
Bm(x — [x]), which is obtained by extending Bm(x ) from the interval [0, 1] to the 
entire real line by periodicity, satisfies Bm (x — n) = Bm (x — [x]) for any integer n 
and n < x < n + 1 , we obtain 

1 / , « + l M— 1 j) 

- (f(n + 1 ) + f(n)) = / f(x)dx + 7Trr7d ( f (k> (" + ’)- / ( * } («)) 

2 Jn “ (k + 1 )! 

JfT J b m(x - [x])f (M) (x)dx. 

The formula in the theorem is obtained by adding up the above formula from n = a 
to n = b — 1 and then adding \ if (a) + f ( b )) to both sides. □ 


5.2 The Riemann Zeta Function 

The exponential function e z with the complex variable z, is defined by the formula 

z 2 z n 

e z = l +z+ + ... + 

2! n\ 

The power series on the right-hand side converges absolutely for any z e C, and 
thus it is a holomorphic function over the entire "-plane. From the formula for the 
product of two power series, we have the exponential law 

e z+w = e z e w . 

In particular, for x, y e R, we have Euler’s formula 

e x+iy = e x e' y = e*(cos y + i siny). 
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For any positive real number a, the complex power a s for s e C is defined by 


a 


S 


— gSlog a 


Its absolute value is given by 


a = e 


o log a 


where a = Ref.v). 

Let 5 be a complex number. Consider the sum 


N 1 
T - 

„s 


n = 1 


1 + — — h 

2 s 


+ 


N s 


Here, n s is a complex power of n just defined above. Let / : (0, oo) — >• C be the 
function defined by 


f(x) = x s . 


We apply the Euler-Maclaurin summation formula to this fix) with a = \,b = N . 
The Ar-th derivative of / is given as 

f (k \x) = i-s)i-s - 1) • • • i-s - k + \)x~ s ~ k 
= (— 1 ) k sis + 1) • • • is + k — l).v _s_ir . 


Also, if s 1, then we have 


It follows from Theorem 5.1 that for .v / I 




M-l 


B k+X (s + k-l\( 1 \ 

k + l' y k J V N s+k ) 


(s + M - 1 
\ M 



N 


Bm(x — [x])x s M dx. 


(5.1) 


where ( is the binomial coefficient x(x — 1 ) • • • (x — / + 1 ) / i ! , and we used the 
fact (— \) k B k +\ = —B k +\. If i = 1, then it follows from x~ l dx = log N that 
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n= 1 V 7 k= 1 V 7 

/ N 

Bm(x ~ [x\)x~ l ~ M dx. 

As an application of these formulas, we obtain the following elementary result. 

oo 1 

Proposition 5.3. (1) The series} — converges absolutely if Re(.y) > 1, and the 

' n s 

n = 1 

oo 1 

series > — diverges. 

— ' n 
11= 1 

N 1 

(2) Let {a is/} be the sequence defined by = / — log N. Then {a^} converges. 

/ —‘n 

n= 1 

77?e limit y = lim w called Euler’s constant } For any natural number M , 
N-+o o 

we /iflve 


(5.2) 


1 75 /.C 

\ - -pa-H _ / 

£o * +1 A 


~ M) 

JC M+1 


dx. 


In particular, setting M = 1, we /tflve 


y = 1 


i; 


x — [x] 


dx. 


(5.3) 


jv ' j ' v ' j 

Proof. (1) Let Re(.v) = cr. Since > — = ) — , by putting s = o , M = 1 

in formula (5.1), we obtain 


n * z ' n° 

n = 1 n = 1 



/7 — 1 


— (i — b,( X - 

-iv (v a_1 / 2 v w j. 


[x])x 


-CT— 1 


t/x. 


Since tr > 1, both 1/A CT_1 and 1/A CT converge to 0 as A" — > oo. As for the 
integral in the last term, first note that the function Bm{x — [a] ) is bounded 
since 0 < x — [a] < 1 . Thus, if 


/ 


OO 


x a 1 dx 


pN 

lim / x~ a ~ l dx 

J 1 


(5.4) 


3 It is unknown whether y = 0.5772156649015328606065120900824 • • • is an irrational number 
or not. 
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converges, then so does 




Bm(x — [x\)x a 1 dx 


as N —*■ oo. The integral (5.4) converges since if cr > 0, we have 


lim 

TV— >00 


/' 


x a 1 dx 


1 ( 1 \ 1 

= lim —II 1 = — . 

n-*oo a \ N a ) a 

AA 1 

Summing it all up, we see that > — converges absolutely if o > 1 . Also, the 


n = \ 


AA 1 

fact that > — diverges follows immediately from (5.2). 

Z ✓ 17 


(2) It follows from the formula (5.2) that 


A 1 1 / 1\ A B k+ 1 / 1 

n s 2 V (V/ ^ A- + 1 I A 1 ' 

/7=1 AC= 1 


Bm{x -=*■ [x])x 1 


Similarly to (1), the last integral converges, and thus { a ; y } converges. 
Letting A —> oo, we obtain 


y = lim a at 


N-^-oo 


1 ^ ^ D /* OO 

= ~ + J2 rzi ~ Bm(x- [x])x~ x ~ m dx 
1 k = 1 + 1 71 

^ — 1 D /»C 

_ ^+1 _ / 


Bm(x — [x\)x dx 


(Here we used Si = 1/2.) If M = 1, then we have B\ {x — [x]) = x — [x] — \ and 


r°° dx 

■ r 

h 

x _ 


= 1. 


Thus, we have 


-u 

-'-r 


X - [x] - 5 


dx 


x — [x] 


dx. 


□ 
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The famous Riemann 4 zeta function £ ( 5 ) is defined for all complex numbers s 
satisfying Re (s) > 1 by 


Us) 


00 

T-. 

Z— * „s 


n = 1 


As we have already seen, the infinite series on the right-hand side converges 
absolutely for Re(.v) > 1. If a is a positive number greater than 1, then this series 
converges uniformly in Re(.v) > rr, and thus it is a holomorphic function of s in 
Re(s) > 1. Let us study properties of £ (s ) , using the summation formula (5.1). 
Suppose Re(s) > 1, and let N — > 00 in (5.1). Then, 



11 = 1 


1 1 ^ 
^T + 2 + ^ 

i-=i 


Bk+i (s + k- 1\ 
k+ l' ( k ) 



Bm(x — [ x])x s M dx. 


(5.5) 


Now, take the difference between (5.1) and (5.5). Then, for Re(s) > 1, we have 
N , „ „ M - 1 


111 1 

U S > ~ 2^ ~s - — 1 ' fl/s-l - 

n=\ k= 1 


Bk + 1 

k + 1 


<s + k~l 
l k 


1 


's + M - 1 
, M 


)/; 


Bm(x — [x])x 4 M dx. 


]\fs+k 

(5.6) 


Since the integral of the right-hand side converges absolutely in the region Re(s) > 
1 — M, the formula (5.6) gives an analytic continuation of £ (,y) to a meromorphic 
function in the region Re(s) > 1 — M . In other words, £ (,s) can be extended to 
Re(.s) > 1 — M through the formula (5.6). Since M can be any natural number, 
%(s) has an analytic continuation to the entire .v-plane as a meromorphic function. 
It also follows from (5.6) that its only pole is a simple pole located at 5 = 1 , and the 
residue at s = 1 equals 1 . 

We summarize all this, and find the values of %(s) at integers less than or equal 
to 0. 


Theorem 5.4. (1) The function £ (.s) has an analytic continuation to the entire 
s-plane as a meromorphic function, is holomorphic if s 1, and has a pole 
of order 1 at s = 1 with residue 1. 


4 Georg Friedrich Bernhard Riemann (bom on September 17, 1826 in Breselenz, Germany — died 

on July 20, 1866 in Selasca, Italy). 
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(2) Let m be a positive integer. The value of£(s)ats = 1 — m is given by 

B„ 


m 


Proof. Since we have already seen (1), we prove (2). If s = 1 — m, taking M in 
(5.6) with M > m, the binominal coefficient in front of the integral on the right- 
hand side vanishes. This implies that £ ( 1 — m) is expressed as a finite sum. Our goal 
is to express it in a simpler form. In (5.6), let s = 1 — m, and M = in. Then we 
have 

N Aim — 1 m—l „ 

m—l M Is B k+ 1 

$a-m)-y n = — i- y. 7r~nv\ 

' m 2 z — ' (k + 1) 

n = 1 Jfc=l v ' 

x (1 — nt)(2 — in) ■ ■ ■ (k — m) ■ N 
If m = 1 , then there is no term of summation, and 


m—k — 1 


(5.7) 


N 11 
?(0) = E l-N--=-- = -B ] . 


n — 1 


Suppose m > 2. Now, by using the formulas (1 — m)( 1 — m) • • • (k — m) = 

(-l)*fc!(V). *fr(V) = dbra iffc < « - 1. and (-1)*^ = B k for 

k > 2, we have 


E 

ir = l 


#A+1 

(*+D! 


(1 — m)( 2 — m) ■ • • (k — m) • N‘ 


i—k — 1 


m— 2 

£=1 
m—l 

= E(-d 


k B k+ 1 (m-\ 
k , 


m — V 
k + 1 , 


B k + 


l\pn-k - 1 
jym—k—1 


+ ( -i) m -' — 
777 — AT — 1 771 


xAr-1 I 


k=2 


m — 1 
k 




/?7 


-it 


m—l / 

= -vr 


E x 

A=2 \ 


771—1 

\ k 


I _ tf"-* B m 

&k j 

' m — k m 


m 


Therefore, the right-hand side of (5.7) equals 


"(v' 


-E 

&=o \ 


- 


AT' 


7— A: 


— k 


Bm 

m 
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(since Bq = 1 , fi| = 1/2). On the other hand, from the formula for the sum of 
powers (1.1) onp. 1, we have 


N 

n = 1 


m — 1 


E 


m - l\ N m ~ k 
k J k m — k 


(5.8) 


Thus, from (5.7) we have the formula 

•rl \ \ Em 

t(l - m) = . 

m 


□ 

Remark 5.5. The value in (2) was first given by Euler ([32], see also WeiP [103]). 
The values of the Riemann zeta function at positive even integers in Corollary 4. 12 
(P- 61) 

£(2 k) = ' (2k)\ (2ir)2k {k € N) 

were also computed first by Euler (ibid.). There is not much known about the values 
at positive odd integers. It is conjectured that all £(2 k +1) are transcendental 
numbers. 5 6 Apery 7 [5] proved that £(3) is an irrational number, and Rivoal [80] 
proved that there are infinitely many irrational numbers among £(2 k + 1 ) (k = 
1,2,3,...) (For more recent results, see [99].) 

Remark 5. 6. The formula (5.8) for the sum of powers can also be obtained using the 
Euler-Maclaurin summation formula (Theorem 5.1). Indeed, the sum i n m ~ l 
can be obtained by letting f(x) = x nv 1 , a = 1. and b = N in the summation 
formula. 

Exercise 5.7. Use (5.6) (N = M = 1) and (5.3) to prove 

limKCs) E) = y. 

s ^ 1 V S ~ 1 J 


Exercise 5.8. Use (5.6) (N = M = l) to deduce the formula 


p oo 

= s / 

Jo 


x - [x] - \ 
x i+l 


dx 


5 Andre Weil (bom on May 6, 1906 in Paris, France — died on August 6, 1998 in Princeton, USA). 

6 Since jr is a transcendental number [71], f(2 k) are all transcendental numbers. 

7 Roger Apery (bom on November 14, 1916 in Ruen, France — died on December 18, 1994 in Caen, 
France). 
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which is valid for — 1 < Re(,y) < 0. 

Exercise 5.9. Use the previous exercise and the formula (proved in Chap. 4, (4.5)) 

r 1 ^ sin27T«x 

x-[x]~ - = - > 

2 z — ' nn 

n = l 

to deduce the functional equation of the Riemann zeta function 

jr c 

C(s) = —2 s S7t s ~ l r(—s) sin — £(1 — s). 

(This way of proof is rigorously described in Chap. 2.1 in [90], We give a different 
proof of (the symmetric form of) the functional equation in Chap. 9.) 

Exercise 5.10. Carry out the computation suggested in Remark 5.6. 


Chapter 6 

Quadratic Forms and Ideal Theory 
of Quadratic Fields 


There are close relations between Bernoulli numbers and quadratic fields or 
quadratic forms. In order to explain those, we give a survey of the ideal theory 
of quadratic fields and quadratic forms. Since Gauss, it is well known that there is a 
deep relation between the ideal theory of quadratic fields (i.e. quadratic extensions 
of the rational number field) and integral quadratic forms. This is obvious for 
specialists, but textbooks which explain this in detail are rare. In most textbooks, 
they treat the correspondence of ideals only in the case of maximal orders (the 
ring of all integers) for which the description is simple and easy. This is very 
disappointing. We cannot see the whole picture of the theory of quadratic forms by 
such a restricted treatment and sometimes it causes misunderstanding. We need the 
full description of this kind of relation when we explain later the relation between 
L-functions of prehomogeneous vector spaces and the Bernoulli numbers. So it 
would be a good chance to try to explain the full relation here. 


6.1 Quadratic Forms 


For integers a, b, c and variables x, y, we call the homogeneous polynomial 
Q(. x, y) = ax 2 + bxy + cy 2 


of degree two an integral binary quadratic form (or a quadratic form for short, since 
we treat only the integral and binary case in this book). In particular, in case a, b, c 
are mutually coprime (namely the greatest common divisor, g.c.d., of a, b, c is one), 
we say Q(x, v) is primitive. If we define the matrix S by 


5 = 


/ a b/2\ 
\b/2 c ) ' 


T. Arakawa et at., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_6, © Springer Japan 2014 
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then we can write 


Q(x,y) = (. x,y)S . 

Often this expression helps easy understanding. A symmetric matrix like S whose 
diagonal components are integers and the other components are half-integers 1 is 
called a half-integral symmetric matrix. We denote by L* the set of all half-integral 
symmetric matrices of order two: 


L* 



a,b,c e Z 


This set corresponds bijectively to the set of all integral (binary ) quadratic forms. 
When the corresponding quadratic form (x, y ) S ( X ) is primitive, we say that S is 


primitive. We consider the following groups of integral matrices: 


GL 2 (Z) = {g€ M 2 (Z); det(g) = ±1}, 
SL 2 (Z) = {geM 2 (Z);det(g)= 1}, 


where M 2 (Z) is the set of all integral 2x2 matrices. The group SL 2 (Z) is called the 
(full) modular group and this is the subgroup of GL 2 (Z) of index two. The group 
GL 2 (Z) (and hence also SL 2 (Z)) acts on L* from the right by 

L* 3 S \-3- det(y) ‘ySy e L* (y e GL 2 ( Z)). 

Two elements of L* or the corresponding quadratic forms are said to be SL 2 (Z)- 
equivalent, or just equivalent, when they belong to the same SL 2 (Z)-orbit under this 
action. Two elements of L* are said to be GL 2 (Z) -equivalent when they belong to 
the same GL 2 (Z) orbit by the above action. (In some books, SL 2 (Z)-equivalence 
is called a proper equivalence, but we do not add the word “proper” for this in this 
book). Usually we call the number b 2 — 4a c = — det(2.S) the discriminant of the 
quadratic form. We denote this number by D(S). It is obvious that D(S) = 0 
or 1 mod 4. On the other hand, if D is an integer such that D = 0 or 1 mod 4, 
there exists S e L* such that D(S) = D. The discriminant is invariant by the 
action of GL 2 (Z). The cases where D is a square (i.e. a square of integers, e.g. 
0, 1, 4, 9 ... ) are exceptional, so in this chapter, we assume that D is not a square 
unless otherwise stated. We fix such a non-square integer D = 0 or 1 mod 4 and put 
L*(D) = {S' € L*; D(S ) = D, S primitive}. Then it is known that both SL 2 (Z)- 
equivalence classes and GL 2 (Z)-equivalence classes in L*(D) are finite. (The proof 


'Here half-integers mean their doubles are integers. 
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of the finiteness of the class number and an explicit formula for the class number 
will be given later.) When D ( S ) > 0, we call these numbers of equivalence classes 
the class number and the class number in the wide sense of quadratic forms of 
discriminant D for SL 2 (Z)-equivalence and for GL 2 (Z) equivalence, respectively. 

When D(S) < 0, we have ac > 0, and this means that S is positive definite 
or negative definite. We denote by L * ( D ) the subset of positive definite quadratic 
forms of L*(D ) and by L*_(D ) negative definite forms. When D(S) < 0, we say 
that the number of SL 2 (Z) equivalence classes in L* + (D ) the class number of the 
quadratic forms of discriminant D. In this case, the number of GL 2 (Z) equivalence 
classes in L*(D) and the number of SL 2 (Z) equivalence classes i n L * (D) are the 
same. Indeed this is proved as follows. Any S e L*_(D) is GL 2 (Z) equivalent to 


an element of L/(D) since for y 



, we have det (yYySy = — r ySy e 


L* + (D). If Si and S 2 G L/(S) are GL 2 (Z) equivalent and S 2 = det(y)' yS\y for 
y e GL 2 (Z), then since S 2 and ' y.S\ y are positive definite, we have det(y) > 0, 
hence det(y) = 1, and Si and S 2 are SL 2 (Z) equivalent. 


6.2 Orders of Quadratic Fields 


Now we review the theory of quadratic fields, but there are many books describing 
this theory, so here we write only briefly. Also, we describe the theory mostly from 
the viewpoint of quadratic forms. In some parts this makes our explanation more 
complicated. But this is still useful since this kind of explanation is seldom found in 
books nowadays. 

When m is a non-square integer, we say that K = Q( y/m) = Q + Q -JTn is a 
quadratic field. When m > 0, we call K a real quadratic field and when m < 0, we 
call it an imaginary quadratic field. If m has a square factor, we can bring this factor 
outside the square root, so we can assume that m is an integer without square factor. 
We assume this from now on. An element a in this field K satisfying a 2 +ba+c = 0 
for some rational integers b, c (i.e. integers in the usual sense) is called an integer 
of K . The set O max of all integers in K form a ring. If we put 


c o = 


1+ y™ if m = 1 mod 4, 
y/m otherwise, 


( 6 . 1 ) 


then one can show O max = Z + Zo>. The ring D max is called the maximal order (or 
the ring of all integers) of Q (y/m). 

If we define the linear map a a of a € K to a € K as Q-vector space by 
a + b-J~m = a — by/m (a, b e Q), which is called a conjugation, then this is an 
automorphism as a field. For any element a, of K, we write N (a) = act, T r(a) = 
a + a, the former is called the norm and the latter the trace of the element a. 
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We put Dk = (co — co) 2 = T r(co) 2 — 4 N(co) and call it the discriminant, or the 
fundamental discriminant of K. More concretely we have Dk = m if m = 1 mod 4 
and Dk = 4 m otherwise. 

The maximal order D max is neither a principal ideal domain, nor a unique 
factorization domain in general, so it is known that we cannot define a useful prime 
factor decomposition of elements of D max . But if we consider ideals of D max instead 
of elements, any ideal of 0 max is factored uniquely into a product of prime ideals 
of £> max . This fact is a direct consequence of the fact that D max is a Dedekind 
domain. But sometimes it is useful to treat subrings of D max consisting only of a 
part of integers. For example, it is inevitable to treat such rings when we consider the 
correspondence with quadratic forms. But such subrings are not Dedekind domains 
in general, and we must keep in mind that the prime ideal decomposition does not 
hold for such rings in general. Now any subring O of D max containing 1 is a free 
Z-module since any subgroup of a finitely generated free abelian group is free by 
the fundamental theorem of finitely generated abelian groups. When the rank of O 
is two, we call it simply an order. Let us describe all such rings. 

Lemma 6.1. Let co be as in ( 6 . 1 ). For an arbitrary natural number f , we put 

D f = Z + fcoZ. 

Then this is an order of Conversely, for any order D ofQ(*/m), there exists 

a natural number f such that 0 = 0/. 

Proof. To show that Z + / coZ is an order, we must show that this is a subring. This 
is shown by the following calculation. 

YYl — 1 

f(fco) H — f 2 if m = 1 mod 4, 

f 2 m otherwise. 

Conversely, if O is an order, then since it is of rank 2, the ring O contains an element 
which does not belong to Z. So there is an integer a and non-zero integer b such 
that a + bco G O. Since we have 1 e O by definition, we have a G O so bco G O. 
We take the smallest positive integer / among those b such that bco G D. All the 
other such b are divisible by /. Indeed, for b such that bco G O, we divide b by f 
and write b = fx + bo, x G Z, 0 < b 0 < f . Then we have boco = ( b — f x)co = 
bco — x{fco) e D. By our choice of /, we must have bo = 0, and b is divisible 
by /. Hence we have D = Z + fcoZ = D/. □ 

Orders D / for different / are different rings. If / and f are natural numbers, then 
it is easy to see that £)/' C Of if and only if / 1 f. This ring can be also written as 


If co) 2 = 


L) / — Z + / 0 max 
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We call / the conductor of the order O /. The conductor of an order is 1 if and 
only if it is maximal. We define the discriminant of the order Of by D k f 2 ■ It is 
conventional to call non- zero ideal o of 0/ simply an ideal of Of . Since an ideal a 
is a submodule of Of, it is a free Z-module, but since it is also a module over D /, 
the rank is two. So the ring D//a is a finite ring. The size of this finite ring is called 
the norm of a and denoted by N(a). We describe the free basis over Z of the ideal 
a of Of more concretely. We say that an ideal a is primitive if it cannot be written 
as a = lb for any integer / > 1 and any ideal b of Of . 

Lemma 6.2. For any ideal o of O /, there exist positive integers a, e, d with 
—a/2 < d < a/2 such that 


a = Za + Z (d + efco). 

Here the integers a , b , d are uniquely determined by a. If a is a primitive ideal, then 
we have e = 1 and the norm of a is a. 

Proof For a while, we take an arbitrary ideal a of Of and do not assume that it is 
primitive. Since the module D//a is a finite module, there is some natural number 
m such that ml e a and we have aflZ 0. So we take the smallest positive rational 
integer a in a. Then it is clear that a Z = a fl Z. On the other hand, we denote by 
e the smallest positive integer among those y such that x + yfco e a for some 
integer x. We take d e Z such that d + efa> € a. Then we can conclude that 

o = Z a + Z (d + efco). 

This can be seen as follows. First, if x + yfco e a for some x, y e Z, then y is 
divisible by e. Indeed, if y = eq + r with 0 < r < e and 0 < q, then we have 
x + yfco — q(d + efco) = (x — qd) + rfco e a. Flere if r 0, it contradicts 
the definition of e, so we have r = 0. Flence there is an integer z such that (x + 
yfco) — z(d + ef co) = x — zd e Z n a, which is a multiple of a. Hence we have 
a = Za + Z(d + efco). By adding a multiple of a to d + efco if necessary, we 
may assume that d satisfies —a/2 < d < a/ 2. If we take a, e and d as above, 
then it is clear by our choice that the integers a, d, e are uniquely determined by 
the ideal a. Now we show that if a is primitive in addition, then we have e = 1 
in the above. Indeed, since n is an ideal of Of, we have fata C a and hence 
afco e o. So we have afco = xa + y(d + feco) for some integers x, y. This 
means a = ye and xa + yd = 0. So a/e = y e Z and d/e = —x e Z. 
So if we put b = Z (a/e) + Z (d/e + fco), then we have a = eb. Since a is an 
ideal of Of, we have e(ab) = aa C n = eb, and hence we have ab C b. So 
b is also an ideal of Of. Since we assumed that a is primitive, we have e = 1. 
Since Of = Z + Z fco = Z + Z(d + fco) and a = Z a + Z(d + fco), we have 
Of/ a = Z/aZ and the norm of the primitive ideal a is a. □ 

We call ( a,d + efco) in the above lemma the standard basis of the ideal a. 
Although we described the standard basis of an ideal above, we are not claiming 
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that the module M = Za + Z (cl + /ft)) is always an ideal for any random choice 
of a, d G Z. The module M is an D /-ideal only when a, cl, f satisfy some special 
conditions. We shall describe these conditions below. 

Lemma 6.3. The module M = Za + Z(d + fed) is an ideal of D / if and only if 

a\N(d + /to). 

This is equivalent to saying that there exists an integer c such that 

h 2 — f 2 DK = 4 ac, 
where we put b = Tr(d + /ft)). 

Proof All we need to do is to examine the condition fcoM C M . We have /ft) x 
a = ( —d)a + a(d + fco) g M . As for fco(d + /ft)), we have 

fco(d + fco) = (d + Tr(fco))(d + fco) — N(d + fco). (6.2) 

So this belongs to M if and only if N(d + fco) is divisible by a. Since we have 

b 2 - f 2 D K = (2d + fTr(co)) 2 - f 2 (Tr(co) 2 - 4 N(co)) 

= 4 (d 2 + dfTr(co) + f 2 N(ca)) 

= 4 N(d + fco). 


this is equivalent to say that 


b 2 — f 2 D K = 4 ac 


for some c G Z. □ 

Remark 6.4. (1) Notation being the same as in the last lemma, we have 


d + fco = 


b 4~ \J T) 
2 


where D = f 2 DK'- the discriminant of D/. 

(2) The norm a of a primitive ideal and the conductor / might not be coprime. For 
example, the module Z5 + Z5\/2 is a primitive ideal of the order £>5 of the 
quadratic field K = Q(\/2) with conductor 5 and its norm is 5. We can write 
Z5 + Z5 V2 = 5(Z + Z\[2), but Z + Z\fl is not an ideal (nor a subset) of O 5 . 

For an ideal a of Of, the ring {x G K; xa C a} is a subring of Q max which 
contains D /, so it is O f for some divisor f of /. When this coincides with D f, 
that is, when / = /', we say that a is a proper 0/-ideal. (In the usual ring theory. 
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the term proper ideal is often used to indicate the ideal which is not the whole 
ring itself. Note that our usage here is different from that.) Next, we shall see the 
condition that a primitive ideal of D / is a proper D /-ideal. 

Lemma 6.5. For a primitive ideal a of D /, we write a = Za + Z (d + fed) by the 
standard basis and define integers b, c as in Lemma 6.3. Then the ideal a is a proper 
D f -ideal if and only if ged of a, b, c is one. 

Proof. Assume that a is an Df module for a divisor f of /. If a is an Dp 
module, then afw G a, hence we have afw = xa + y(d + fw). Comparing 
the coefficients, we see a = y(f / f) and xa + yd = 0. But since a > 0, 
we have y f 0 and so d = —x(f/f). Hence we have a.d G (f/f) Z. So 
bf'/f = df'/f + f'Tr(w) G Z and b G (f/f) Z. Multiplying the Eq. (6.2) in 
the proof of Lemma 6.3 by f/f , we have 

fm(d + fa >) = ( fd/f + fTr(w))(d + fw) -acf/f 

Since fw(d + fw) g a, we have cf/f g Z, i.e. c g f/f Z. So if f < f, 
then the g.c.d. of a, b, c is bigger than one. Conversely, assume that n is the greatest 
common divisor of a , b, c. We first show that n always divides /. Indeed, write 
a = nao, b = nbo, c = nco . Then b 2 — 4ac = n 2 (bl — 4«oCo) = / 2 -Da:- If Dk 
is odd, then Dk does not contain any square factor, so we have n \ f . If Dk is even, 
then Dk = 4 m, where m = 2 or 3 mod 4 and m is square-free. If n is odd, then 
by the same reason as before, we have n\f. Assume n is even and n = 2«o- Then 
we have «o I /■ If // «o is odd, then (/ / nffm = 2 or 3 mod 4 but this is equal to 
b^ — 4aoCo which is 0 or 1 mod 4 and we have a contradiction. So // n o is even 
and we have n\f. So if we assume that n > 1 and define f by nf = /, then 
f < f and a, b,c e (/ //') Z. Then we also have d e (/ / /') Z. Indeed we have 
(f/f)\b = Tr(d + fa>) = 2 d + fTr(co) and ( f / f) 2 \ac = N(d + fio) = 
d 2 + 2dfTr(co) + f 2 N(co). So we see that (f/f) 2 divides 

(2d + fTr(m)) 2 -2 (d 2 + 2dfTr(w) + f 2 N(co) = 2d 2 + f 2 (Tr(co) 2 - 2 N(co)). 

Since f/f divides /, we have (f / f ) 2 \2d 2 . So we have (f/f)\d. So we have 
a, b, c, d e (// f) Z. Hence we see that afeo = (a f / f)fu> = —a(df/ f) + 
C af/f)(d + fat) e a and fco(d + fw) = (df/f) + f'T r(w))(d + fw)fw - 
ac(f / f) G a, so a is an ideal of D //. □ 

Lemma 6.6. If the discriminant of a quadratic form ax 2 + bxy -(- cy 2 is not a 
square, then there exists a quadratic field K and a natural number f such that 
b 2 — Aac = f 2 D K . 

Proof. We have b 2 — 4 ac = 0 mod 4 or 1 mod 4. So we can write b 2 — 4 ac = 
2 e ffmo for some odd natural number f, some odd number mo with no square 
factor and some non-negative integer e. If e = 0 then m o = 1 mod 4. Since 
we assumed that b 2 — 4 ac is not a square, the field K = Q(fho) is quadratic 
and Dk = mo- Then Dk and f = fo satisfy the demand. If e f 0, then 
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b 2 — 4 ac = 0 mod 4, so we have e > 2. If e is odd, then put K = Q(f2mo). 
Then we have Dk = 8/?to and / = 2^ e ~ 3 ^ 2 fo satisfies the demand. If e is even, put 
K = Q(^/mo).lfino = 1 mod 4 then we have Dk = mo, and then put / = 2 e / 2 /o. 
If m o = 3 mod 4 then we have Dk = 4;«o and put / = 2 <e_2, ' /2 / 0 . These satisfy 
the demand of the lemma. □ 

This lemma suggests that there is a deep relation between binary quadratic forms 
and quadratic fields. For each proper primitive ideal a of an order of a quadratic 
field, using a, b, c of Lemma 6.3, we define a quadratic form Q a by 

Q a (x , y) = ax 2 + bxy + cy 2 . 


We can also write this as 


/ b \J D \ / b + \f~D 

Q a (x,y) = N(a) N ixa + y 1 = aN I x + y — — — 

Here we put D = b 2 — 4 ac, and a and b+ ^ = d + fa) form a basis of a. 

Now in order to see relations between equivalence classes of quadratic forms 
with some equivalent classes of ideals of a quadratic order, we give the following 
definitions. We say that two proper O / -ideals a and b are equivalent in the wide 
sense if there exists a e K x such that a = (a)b. If N(a) > 0 in the above in 
addition, we say that a and b are equivalent in the narrow sense. Here the condition 
N(a ) > 0 is satisfied automatically if K is an imaginary quadratic field. If K is a 
real quadratic field, then taking —a instead of a if necessary, we may replace the 
condition N(oe) > 0 by the condition a > 0 and d > 0. Actually this is the usual 
definition which fits into the general theory. The number of equivalence classes in 
the wide (narrow) sense of proper D /--ideals is called the class number in the wide 
(narrow) sense of Of. It is common to call the number of equivalence classes in the 
wide sense just the class number. If K is an imaginary quadratic field, the definition 
in the narrow sense and that in the wide sense are the same, as we saw already. 

Now let K be a quadratic field and let D k be the fundamental discriminant of K. 
Let / be a natural number. We define a mapping from the set of proper primitive 
ideals a of D/ to the set of quadratic forms by setting Q a (x , v ) = ax 2 + bxy + cy 2 
by using the standard basis such that a = Za + Z (d + fad), where we define b, c 
by b = 2d + T r{co), N(d + fad) = ac. Since the standard basis is unique, this is 
a well-defined mapping from ideals. 

This mapping does not give a bijective mapping of proper primitive ideals to 
primitive quadratic forms. But we can prove the following claim. 

Theorem 6.7. When K is an imaginary quadratic field, that is, if Dk < 0, then the 
above mapping induces a bijective correspondence between the set of equivalence 
classes in the narrow sense of proper O j ideals to the set of SLi(Z) -equivalence 
classes of positive definite primitive quadratic forms with discriminant f 2 DK ■ If 
K is a real quadratic field, that is, if Dk > 0, then this gives a bijection from 
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the set of equivalence classes in the narrow (wide) sense of proper Of ideals to 
the set of SL 2 (Z,)-equivalence ( GLofZd-equivalence) classes of indefinite primitive 
quadratic forms with discriminant f 2 Dx- 

Proof. For simplicity, we write L(D) = l*(D) if D > 0 and L ( D ) = L * ( D ) 
if D < 0. We denote by L(D)/ ~ the set of GL 2 (Z) equivalence classes in L(D) 
and by L(D)/ ss the set of S Li(7.) equivalence classes. Since any Of ideal is 
equivalent (both in the wide sense and narrow sense) to the primitive ideal, it is 
sufficient to consider primitive ideals. For a given proper primitive D / ideal, we 
define a, b, c as before. Then since (a. b.c) = 1, the quadratic form 0 a (x , y ) = 
ax 2 + bxy + cy 2 is primitive and the discriminant of O a is f 2 D k ■ By our choice 
of the standard basis of a, we have a > 0. So if D(S) < 0, then Q a (x,y) is 
positive definite. So the mapping a to Q a (x,y) induces mappings to L(D)/ ~ 
and to L(D)/ ta. We first show that this mapping is surjective. Take a primitive 
quadratic form Q(x,y) = ax 2 + bxy + cy 2 . If this is positive definite, then we have 
a > 0 automatically. If this is indefinite, then we might have a < 0, but changing the 
quadratic form by SLofL) equivalence, we may assume that a > 0. This is proved 
as follows. Assume that a < 0. For a given x, y, we have 4 a(ax 2 + bxy + cy 2 ) = 
(lax + by) 2 + (4ac — b 2 )y 2 . We have 4oc — b 2 < 0 by the assumption that the 
quadratic form is indefinite. We fix an integer y such that \a\/ \lb 2 — 4 ac < y. 
Choosing x suitably, we can assume that lax + by is in any set of representatives 
modulo la. So we can assume that \lax + by \ < \a\. So we have (lax + by) 2 — 
(b 2 — 4 ac)y 2 < 0 for some x. This means that there exists (x.y) G Z 2 such 
that ax 2 + bxy + cy 2 > 0. This property still holds if we divide x, y by the 
g.c.d. of x and y, so we may assume that x and y are coprime. We can take an 
element A of 5L2(Z) such that (x , y) is the first row of A. So the (1,1) component 
of A ^ h “ 2 b ^ 2 J 'A is positive. This proves the assertion. We assume a > 0 from 

now on. Let D be the discriminant of Q , that is, D = b 2 — 4 ac = f 2 DK- Since we 
have (fTr(co)) 2 = f 2 (DK + 4N(u))), we have (fTr(co)) 2 = / 2 Da' = b 2 mod 4 
and we have b = fT r(a>) mod 2. So if we define d by b = Id + fTr(co), then 

b + VD 

o = Z a + Z (d + fco) = 7 jCI + Z 

is a proper primitive ideal of 0/ since we assumed gcd(a. b,c) = 1. This d might 
not satisfy the condition —a/1 < d < a /l, but changing b to bo = b — lak for 
some k, we see that do = d — ak satisfies the condition. So we take k G Z such that 
— a/2 < do = d — ak < a/2. If we define bo = b — lak and cq = c — bk + ak 2 , 
then b} } — 4aco = f 2 Dk and we have aco = N(do + fco), bo = Ido + fTr(co). 
So we have a = Za + Z(<io + fco). We have 

Qa(x, y) = a 0 x 2 + b 0 xy + c 0 y 2 

= a(x — ky) 2 + b(x — ky)y + cy 2 , 
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so this is SL 2 (Z) equivalent (a fortiori, GL 2 (Z)-equivalent) to Q(x, y). So the 
mapping to L(D)/ ~ or L(D)/ ss is surjective. Next we see that these mappings 
to L(D)/ ~ and L(D)/ ss induce well defined mappings from ideal classes in the 
wide sense and in the narrow sense, respectively. Write two proper primitive O / 
ideals by standard bases as a,- = Za, + Z (bi + *J15)/ 2, where D = bf — 4a j c, 
(i = 1, 2), and assume that a 2 = Oicr for some a e Q(\/7)) x . Then there exists 
A e GL 2 (Z) such that ( 02 , (b 2 + s/~D)/2) = (a\a, (b\ + V~D)a/2)A. Since we 
have 


G, g+a/d ' 1 
a 2 5 

h-Vb 


UG — 7 — ; 


a 0 
0 a 


we have a 2 = aiiV(a) det(^4). Writing ( X , T) = (x, y) 'A, we also have 


a 2 (a 2 X 2 + b 2 xy + c 2 y 2 ) = N I a 2 x + 


b 2 + \/~D 


-y 


= cnN(u)(aiX 2 + b\XY + c x Y 2 ) 

= a 2 det(A)(aiX 2 + b\XY + crT 2 ). 


Since «, > 0 for i = 1, 2, we have N(a) det(A) > 0. So we are done. Next 
we see that the mapping is injective. For two primitive proper Of ideals Oi = 
Zfli + Z(d\ + fed) and a 2 = Za 2 + Z(d 2 + fco), we define 

Qai(x,y) = a\x 2 + b\xy + c\y 2 = a^ 1 N{a\x + {d\ + fu>)y ) and 
Q a 2 (x, y) = a 2 x 2 + b 2 xy + c 2 y 2 = af l N(a 2 x + (d 2 + f(d)y), 

as before. Assume that Q ai (x,y) = det(A)Q ai ((x, y)A) for some A e GL 2 (Z). 
We define co\, co 2 € K = Q (*/TJk) by 


(co\,co 2 ) = (1 a\,d\ + fco ) 1 A = (a\, — — r— — 'i ' A- 


and put a = a 2 /u>\. We will show that 02 = crai and det(A)A(a) > 0. Since 

«i A „ ( «i \ _ (vi 


a ‘ x + ^ + My = <*• y) U + /*>)• and A v, + /<» 


CO 2 


, we have 


}a\((x,y)A) = a\ l N ({x,y)A ^ = a i 1n (xw x + ya> 2 ) 


a ] i {x 2 N( 0 L>\) + xyTr(a>\a> 2 ) + y z N(co 2 )). 
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Since this is equal to det)A)Q ai (*, y), we have 

N{coi) = d&\.{A)a\ci2 and TrQioicof) = det{A)a\b2. 


So we have 


iV(a) det(. 4 ) = &ei(A)a\/ N (co\) = gq/ai > 0 . ( 6 . 3 ) 

Now, since A G GL2CZ), we have ai = Zaii + ZW2. So we have aia = 
Z«2 + 7 j(a. 2 ( 02 / (»\). So it is sufficient to show that 020)2/0)1 = t/i + fo). We have 
a2(i>2/co\ = d20)2C0\ / N (a>\) = det{A)a)2C0\/a\. We must calculate o)2p)[. Taking 
the conjugate of the definition of or, , we have 

( (Du o> 2 \ _(a\,di + fo)\ , 

\c0uc02J \ai,di+fcoJ 

So taking the determinant, we have 

— CO1CO2 + (Did) 1 = det( 4 )ai/(&) — oJ). 

So adding twi©2+dh®2 = det (^4 )i>2« 1 = det(H)ai(2<j?2 + Tr(/ft>)) to the both sides 
and dividing by 2 , we have = det(y 4 )ai(c ?2 + /cd) = {N {(D\) / a2)(d2 + for). 
So we have to = ci| ». Hence ai and ai are equivalent in the wide sense, and if 
det(, 4 ) = 1 , then equivalent in the narrow sense. Hence the mapping is injective. 

□ 

When D K > 0 , the equivalences of proper Of -ideals in the wide sense and in 
the narrow sense are the same if and only if there exists s e 0 Y , with N(s) = — 1 , as 
we can see easily. We give the corresponding fact in the case of symmetric matrices 
(or quadratic forms) in the following proposition. 

Proposition 6 . 8 . Assume that D = f 2 Dk is positive and is not a square. Assume 
that for Si, S2 € L*(D ) we have S2 = — r B S\B for some B e GL2CZA with 
det( B ) = — 1 . Then there exists C e S L2CT) such that S2 = 1 CS\C if and only if 
there exists s e such that N(s) = — 1 . 

Proof. We may assume that Si and S2 are primitive. First we assume the existence 
of C . Then if we write A = BC~ l e GL2(Z), then we have det(H) = —land Si = 
det(, 4 ) ' , 4 S] A. We have shown in the proof of the last theorem that Si is SZ.2(Z) 
equivalent to a matrix S in L(D) corresponding to a quadratic form Q a (x. y ) for 
some primitive Of proper ideal a = Za + Z (d + for). Now we put (c 0\,a)2 ) = 
(a , d + for ) A and a = a /co\, and apply the argument of the proof of the last theorem 
for ai = CI2 = n. Then by ( 6 . 3 ) we have N(a) < 0 and we have aa = n. Since a is a 
proper Of -ideal, we have a e Of . We also have a = aa - 1 ,soo ! _1 e G/andA^(a) 
and N((x~ l ) are integers. So we have /V ( o' ) = — 1 and a € Of Conversely, if there 
exists € e D* with N(e) = — 1 , then we have ae = a. If we define A G GL2(Z) by 


(a.d + f(o)e = (a,d + for) A, 
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then det(A) = N(e ) = —1 and 0 a ((x, y)A) = det{A)Q a (x, y), that is, AS r A = 
det(A)S. Assuming that Si = PS'P for P € SLiiTA, C = PAP~ { satisfies the 
condition in the proposition. □ 

Example 6.9. (1) If we put K = Q (-Jm) and a = 7m + Z -Jm, then Q a (x, y) = 
\rn\~ 1 N(\m\x + y s/m) = \m\x 2 — sgn(m)y 2 . 

(2) We take K = Q(>/-5). For an ideal n = 3Z + (1 + </-5)Z of D max = 
Z + ZV=5, we have Q a (x, }’) = 3x 2 + 2 xy + 2 v 2 . On the other hand, for 
D max = (1) itself, we have Qx\) = x 2 + 5y 2 . Any primitive quadratic form 
with discriminant —20 is equivalent to one of these two. 

Remark 6.10. In the above, we did not treat negative definite quadratic forms. But 
this causes no problem since all these can be obtained by multiplying —1 to the 
positive definite ones. On the other hand, we excluded primitive quadratic forms 
with square discriminant. These quadratic forms are (as far as discriminant is not 
zero) very special quadratic forms in various senses. (For example, the volume of 
the fundamental domain with respect to the unit group (the automorphism group) 
is not finite etc.) We would like to omit such details, but at least we try to give a 
classification of equivalence classes below. 

Let Q(x, y) = ax 2 + bxy + cy 2 be a primitive quadratic form and assume that 
b 2 — 4 ac = f 2 for a positive integer /. First we see that we may assume a = 0 by 
replacing Q by an equivalent quadratic form. Indeed, if a ^ 0, then noting that the 
equation ax 2 + bx + c = 0 has a solution x = {—b ± f)/2a, we denote by e the 
g.c.d. of — b + f and 2 a and define xo, Vo by —b + f = x^e and 2 a = y§e. Then 
Xo and yo are coprime and there exist wo, zo £ Z such that XqWq — yozo = L Since 

we have a ^ j + b f ^ + c = 0, we get ax$ + bxoyo + cyfi = 0 and 

( Xo >'()A ( a b/2\ fx () zo A _ / ax l + bx 0 yo + cyl *\ _ /0 *\ 

\zo Wo) \b/2 c ) V>'o W 0 ) V * *) V* *) ’ 


So we can assume that 0(x,y) = bxy + cy 2 up to SZ, 2 (Z)-equivalence. If the 
discriminant of Q is 0. then b is also 0, and by the assumption that it is primitive, 
we have c = ±1. It is easy to see that the quadratic forms y 2 and — y 2 are not 
SL 2 (Z)-equivalent but and GL 2 (Z)-equivalent. So we classified the case where the 
discriminant is zero. From now on, we assume that the discriminant is not zero. We 

consider an element A = ( 01 ^ ) e G L 2 (7) which maps bxy + cy 2 to quadratic 

\K sj 

forms whose coefficient of x 2 is 0. Since 

0 b/2\ _ f bay + cy 2 * 

b/2 c ) \ * * 



we should have y = 0 or ba + cy = 0. Since b and c are coprime, we have z, 
w € Z such that bz + cxv = ± 1 . So such an A is given either by 
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A = 



with dett A) = ±1. For the former one, if we take bz + cw = 1, then we have 
det(A) = 1 and 


/ 0 b/2\ ^ _ / 0 —b(bz + cw)/l\ 

\b/2 c J y—b(bz + cw)/2 bzw + cw 2 J 


= det(d) 


( 0 -b/2\ 

\-b/2 w J ’ 


so we can choose a representative of SLi(Z) equivalence classes such that b > 0. 
On the other hand, we have 


/10W 0 b/2\(\z\ = ( o b/2 \ 
\z l) \b/2 c ) lo l) \b/2c + bz) 

so we can take c among representatives modulo b. We have 


/I 0 W 0 b/2\ /I 0\_ / 0 b/2\ 
\0 — 1 / \b/2 c ) \0 —1/ _ \b/2 —cj 


So representatives of .8' /. 2 (Z)-equi valence classes and GL 2 (Z) -equivalence classes 
with discriminant f 2 ^ 0 (/ >0) can be taken in the set of the following matrices: 



with 0 < c < f — 1 and 0 < c < // 2, respectively. The fact that these are not 
equivalent with each other can be seen easily by the above calculation. In particular, 
the number of SL^iZ)- or GL 2 <Z)- equivalence classes of quadratic forms with a 
fixed square discriminant is finite. The number of SL 2 (Z) equivalence classes is /, 
and if we restrict to the primitive classes, the number is (p(f), where (p( f ) is the 
Euler function. The case where the discriminant is not a square will be explained in 
the next section. 


6.3 Class Number Formula of Quadratic Forms 

In this section, first we prove the finiteness of the class number (the number of 
equivalence classes) of primitive quadratic forms. As we mentioned in the last 
section, this is equivalent to the finiteness of the class number (in the narrow sense or 
in the wide sense) of quadratic orders D /, which are not necessarily maximal. Next 
we state a formula of the class numbers (in the wide sense) of quadratic orders. (The 
proof is postponed until Chap. 10.) As we saw in the last section, the class number 
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in the narrow sense is easily obtained from the one in the wide sense. Actually, once 
we know the class number formula for the maximal order, the class number of an 
arbitrary order is determined. The reason for this can be explained if we use group- 
theoretic language, but in Chap. 10, we will give a different direct proof using zeta 
functions. 

Proposition 6.11. The number of S Li(7fi equivalence classes of binary primitive 
quadratic forms of a given discriminant is finite. In particular, the class number of 
binary primitive quadratic forms with discriminant f 2 Dic is finite. 

Proof. We already explained the case where the discriminant is square. So, here 
we treat only those with discriminant f 2 D k, where K is a quadratic field and Dk 
is the fundamental discriminant of K. We fix f 2 DK now and we consider only 
those quadratic forms which have this number as discriminant. We say first that any 
primitive quadratic form is equivalent to ax 2 + bxy + cy 2 with \b\ < |o| < |c|. 
Indeed, we take a quadratic form ax 2 + bxy + cy 2 so that \a\, the absolute value 
of the coefficient of x 2 , is the smallest among those equivalent to the original 
one. Then by the transformation (x, y) — »• (— y, x), the coefficient x 2 becomes c, 
so by our choice of a, we have \a\ < |c|. For some integer m e Z, we have 
— \a\ < b — 2/na < |o|. So, changing the quadratic form to the equivalent one by the 
transformation (x, y) — > (x — my, y), we have a(x — my) 2 + b(x — my)y + cy 2 = 
ax 2 + (b — 2 am)xy + (c — bm + am 2 )y 2 , we may assume that — \a\ < b < \a\ 
from the first. So we can assume that \b\ < \a\ < |c|. (In particular, if D k < 0, we 
may assume that the quadratic form is positive definite, so we have \b\ < a < c.) 
On the other hand, we have b 2 — 4 ac = f 2 DK by definition, so we have 

4|ac| — b 2 < \4ac — b 2 \ = / 2 |Djt|. 

But since b 2 < a 2 , a 2 < \ac\, we have 3 a 2 < 4\ac\ — b 2 < / 2 |Dx-|. So \a\ < 
\] / 2 1 Z) a' | / 3 . It is obvious that there is only a finite number of such integers a. 
Hence the number of b is also finite by | b \ < |a|andc = [b 2 — f 2 DK)/4ct is also. 
Hence the assertion is proved. □ 

Let D k be the fundamental discriminant of a quadratic field and put D = f 2 DK- 
We denote by h(D) the class number in the wide sense of the order 0/ of the 
quadratic field K = Q + Q to. To describe the class number formula, we introduce 
the quadratic residue symbol. For an odd prime p and an integer a, the quadratic 
residue symbol (jj with respect to p is defined as follows. 

if a ^ pTu and there exists x e Z such that x 2 = a mod p, 
if a pTu and there is no x e Z such that x 2 = a mod p, 
if a e pTj. 

The following three relations are known as the reciprocity law of the quadratic 
residue symbols. 
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(1) For odd primes p, q, we have 



(2) For an odd prime p, we have 





= (_!)<>— 1)/2 


(3) For an odd prime p , we have 




The proof will be omitted here. (See e.g. [83].) 

For the fundamental discriminant Dk of a quadratic field and a prime p. we 
define the notation Xk(p ) by 


Xk(p) = 



if p is an odd prime, 
if p = 2 and Dk = 1 mod 8, 
if p = 2 and Dk = 5 mod 8, 
if p = 2 and Dk = 0 mod 4. 


We also define xk( 1) = 1 and 


Xk(~ 1) 


j 1 if D k > 0, 
{-1 if Dk < 0. 


Moreover, if u is an integer and if u = ± ■ ■ ■ p‘‘: is a prime factor decomposition 

of u, we put xk(u) = Xk(±\)Xk(P\Y' ■ ■ ■ XidPr) er ■ Actually, by using the 
reciprocity law and other things, one can show that Xk(u) depends only on 
u mod D K ■ We omit the details of the proofs of this fact (cf. [107, §5]). The function 
Xk prolonged in this way is a primitive Dirichlet character modulo \Dk\. It is 
common to write this character as So in this book, we sometimes use this 

notation instead of xk- 

Now we can write down the class number formula by using the notation 
Xk(u) = (^jp)- When Dk > 0, by Dirichlet, it is known that 0* mx = {±1} x 
{fi"; n € Z} for some e e 0' nu t with f / ± 1 . This e is called the fundamental unit 
of K. 
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Theorem 6.12. (1) The class number of the quadratic field K with discriminant 
Dk is given by 


h(D K ) 


w 

~~D~k 

log(e) 


\Dk\- 1 


Dk_ 

U 


'A\ * / 

£( 

//= 1 v 7 


\og(sm(uji / D K )) 


if D k < 0, 


if D k > 0. 


Here w is the half of the number of units ofD max and when Dk > 0, we denote 
by e the fundamental unit of K. 

(2) The class number of the order O / of K with discriminant D = f 2 DK is 
given by 


HD) 


HD K )f 

[£>max ■ Of] 


p\f v F v 


°t)) 


Here for any ring R, the notation R x means the group of all units of R. 

As we mentioned before, the class number in the narrow sense can be easily 
obtained by this formula, and the class number of primitive quadratic forms also. 
The proof of the class number formula will be given in Chap. 10 when K is an 
imaginary quadratic field. When K is a real quadratic field, the proof will be omitted 
from this book. 

The class numbers of imaginary quadratic fields have a relation to the Bernoulli 
numbers. Indeed, if we put x(H = (^), then by Sect. 4.2, we have 


j \o K \-t 

B U = TT J2 

°K 


so if Dk < 0, we see 


h(D K ) = —wB\ x . 


(6.4) 


(6.5) 


In particular, if p is a prime such that p = 3 mod 4 and p > 3, then it is shown 
by using (6.4) and the reciprocity law of quadratic residues that the class number 
h(—p) of Q (^/-p) is given by 


H-p) = -- 

P 



u. 


( 6 . 6 ) 


By the above formula, we see that h(D) = 1 for D = —3, —4, —7, —8, 
— 11, —19, —43, —67 and —163. It is known that there are no other (fundamental 
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discriminants of) imaginary quadratic fields such that Ii(Dk) = 1. Even if we take 
the non-maximal orders of imaginary quadratic fields into account, there are only 
four more orders with class number one, namely D = —12, —16, —27 and —28. In 
contrast to this, it is conjectured that there are infinitely many real quadratic fields 
with class number one , 2 but no proof is known. 

Remark 6.13. For Si and S 2 G L*(D), we can also define an equivalence class by 
S 2 = ' AS \ A for some A e GL 2 (Z).IfSi and S 2 are SL 2 (Z) equivalent in the sense 
we defined before, then of course these are equivalent in the above sense. But the 
equivalences S 2 = ‘ A\SiA\ and S 2 = — r A 2 S 1 A 2 for some A\ and A 2 € GL 2 (Z) 
with det(^4i) = dct(/4 2 ) = — 1 are different conditions in general. For example, for 
two symmetric matrices Si, S 2 G L*(12) given by 



there do not exist A e GL 2 (Z) such that S 2 = ' AS\ A. This can be seen from the 
fact that the diophantine equation a 2 — 3 b 2 = — 1 does not have integer solutions. 
On the other hand, each corresponds to the symmetric matrix obtained from the 
standard basis of 3Z+ V3Z or Z + Zv^3. Since V3(Z + ZV3) = 3Z+V3Z, these 
ideals are equivalent in the wide sense. In fact, we have 




so we have S 2 = det(^4) r ^4S^4 for A = vi oj e GL 2 CL). We will give one more 
example. Take S 3 , S 4 e L*( 23) given by 



Then we have 



but there exists no A e GL 2 (Z) such that S 4 = det(,4) ' A S 3 A . Indeed if there exists 
such A, then since S 3 and S 4 are positive definite, we should have det(,4) > 0. 
Then comparing diagonal components of both sides, we see that we should have 
A = ± 12 - This is a contradiction since (1,2) components do not coincide for 
A = ± 12 . We explain shortly the difference between ST 2 (Z) equivalence and the 
equivalence in the above sense. For any Si e L*(D), define 


2 In Section 304 of his book on number theory, Disquisitiones Arithmeticae [35], Gauss stated his 
speculation on this and also his reasonable insight on the relation to the size of fundamental units. 
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If S 2 is SL 2 (Z) equivalent to Si, then Si (or the corresponding SL 2 (Z) equivalence 
class, or ideal or class in the narrow sense) is called ambig. In this case, for any 
S e L*(D) such that S = ' AS\A for some A e GL 2 (Z), we see that S 
is SL 2 (Z) equivalent to Si. So there is no difference between two equivalences. 
In the case when S 2 is not SL 2 (Z) equivalent to Si, then the GL 2 ( Z) equivalence 
class which contains Si contains two different SL 2 (Z) equivalent classes. To count 
the equivalence classes of L*(D) up to the equivalence that S 2 = ' A S 1 A for 
A € GL 2 (Z), we must count ambig classes, which is a part of the genus theory 
of Gauss. Since this is complicated in general, we do not explain this theory in this 
book. (See [29]. See also Exercises 6.21 and 10.25.) 

Exercise 6.14. For a square-free integer m, show that 1 and <0 in (6.1) gives a basis 
of the maximal order D max of Q {-Jm). 

Exercise 6.15. Show that the quadratic form Q(x,y ) = x 2 + bxy + cy 2 with 
discriminant D is equivalent to the following quadratic form. 

x 2 — -j y 2 if b is even, 

x 2 + xy + y 2 if b is odd. 

Exercise 6.16. (1) Let a = Za + Z h+ ^ be a proper primitive ideal of order D f 
of discriminant D = f 2 Dk , where D k is a discriminant of a quadratic field 
K. We assume a > 0. Show that a is a principal ideal of D / (i.e. a = D/a 
for some element a e D f) if and only if there exists integers x and y such that 
ax 2 + bxy + cy 2 = ±1. 

(2) Show that the class of quadratic forms in the wide sense corresponding to a 
principal ideal a contains the one given in Exercise 6.15. 

(3) Notation being as above, give an example of a principal ideal a such that there 
exists no x, y G Z with ax 2 + bxy + cy 2 = 1. 

(4) Give an example of an equivalence class of quadratic forms Q(x,y) in the 
narrow sense corresponding to a principal ideal (in the wide sense) such that 
Q(x, y) is not SL 2 (Z) equivalent to any form given in Exercise 6.15. 

Exercise 6.17. Fix a positive discriminant D = f 2 D k, where A" is a real quadratic 
field, and consider an integral quadratic form Q(x, y) = Ax 1 + Bxy + Cy 2 whose 
discriminant is D . 

(1) Let \a\ be minimum among coefficients of x 2 of quadratic forms which are 
equivalent to Q. Show that there exists a quadratic form Qi{x,y) = ax 2 + 
bxy + y 2 equivalent to Q such that \[D — 2\a\ < b < ~/D and |a| < |c|. 

(2) Taking Q 2 as in (1), show that ac < 0 and 0 < b. (Use (\[D + b){J~D — b) = 
—4 ac.) In particular, show that 0 < b < \[T>, \H5 — b < 2\a\ < \fD + b. 
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Exercise 6.18. Show that for any integral quadratic form Q , there exists a quadratic 
form SL 2 (Z) equivalent to Q such that the coefficient of x 2 is positive. (See the 
proof of Theorem 6.7, for example.) 

Exercise 6.19. (1) Show that the following pairs of quadratic forms in (i) and 
(ii) are SLi( Z) equivalent respectively. 

(i) ax 2 + bxy + ay 2 and ax 2 — bxy + ay 2 . 

(ii) ax 2 + axy + cy 2 and ax 2 — axy + cy 2 . 


(2) Assume that D = f 2 D k < 0 and a positive definite quadratic form 
Q(x,y) = ax 2 + bxy + cy 2 has discriminant D. As shown in the proof 
of Proposition 6.11, by replacing SL 2 (Z) equivalence, we may assume that 
\b\ < a < c. Such quadratic forms are called reduced. Conversely, assume that 
there are two SL 2 (Z) equivalent positive definite quadratic forms Q , (x , y) = 
a,x 2 + bixy + Ciy 2 with discriminant D with \bj\ < a,- < c; (1 < / < 2). 
Show that (< 2 1 , b \ , c\) = (a 2 , b 2 , C 2 ) except for the following pairs. 

(i) a\ = c 1 and (a 2 .^ 2 .c 2 ) = (a\,—b\,a\). 

(ii) a\ = b\ and (1 a 2 , ^ 2 . C 2 ) = (a\,—a\,ci). 

Hint: Write down the transformation explicitly as <22 = a\a 2 +b\ccy +C\y 2 and 
assuming that «2 > a 1 and using the condition of a\, b\, c\, show that \ay\ < 1. 


Exercise 6.20. (1) Show that h(D) = 1 for D = -3, -4, -7, -8, -11, -12, 
-16, -19, -27, -28, -43, -67 and -163. 

(2) Show that h(— 20) = 2 and give a complete set of representatives of the ideal 
classes. 

(3) Show that /j ( 40) = 2 and give a complete set of representatives of the ideal 
classes. 


Exercise 6.21. Fix a discriminant D = f 2 Dx and consider S = 



L*(D). Show that the following conditions on S are equivalent (the class to which 
such an S belongs is called an ambig class). 


(1) There exists A e GL 2 ( Z) with det(A) = —1 such that ‘ ASA = S. 

(2) The matrix S is SL 2 (Z ) equivalent to 


/ a —b/2\ 
\-b/2 c )• 


(3) There exists S' 


a' b'/2 


€ L*{D) which is SL 2 (Z) equivalent to S 


\b' / 2 c' 

such that b' is divisible by a'. (Here we may take b' = 0 or // = a ' .) 


Hint: If 'ASA = S withdet(A) = — 1, then show that Tr (A) = Oand A 2 = 
Then reduce the case when A is upper triangular. 



Chapter 7 

Congruence Between Bernoulli Numbers 
and Class Numbers of Imaginary 
Quadratic Fields 


In this chapter, we prove a congruence relation between Bernoulli numbers and 
class numbers of imaginary quadratic fields (Theorem 7.1). For that purpose, we 
study in Sect. 7.2 a certain type of power series which Hurwitz 1 introduced (we call 
this “Hurwitz-integral” series). Although one may prove Theorem 7.2 without using 
this, we nevertheless introduce this notion because of its elegance and usefulness. 


7.1 Congruence Between Bernoulli Numbers and Class 
Numbers 

For a prime p satisfying p = 3 mod 4, recall that h (— p) denotes the class number 
of the imaginary quadratic field of discriminant —p. 

Theorem 7.1. Let p > 3 be a prime, p = 3 mod 4. We have the congruence 2 

h(—p) = — 2Bp+i mod p. 

2 

Remark 7.2. (1) By the theorem of Clausen and von Staudt (Theorem 3.1), the 
denominator of Bp+i is prime to p. 

(2) Using Euler’s formula £(1 — k) = — B \/ k (Theorem 5.4 (2), p. 72), we have 


1 Adolf Hurwitz (bom on March 26, 1859 in Hildesheim, Germany — died on November 18, 1919 
in Zurich, Switzerland). 

2 This is due to Augustin Louis Cauchy (bom on August 21, 1789 in Paris, France — died on May 23, 
1857 in Sceaux, France). He expressed the left-hand side by a difference of numbers of quadratic 
residues and non-residues in the interval (0, p/2). That this is equal to the class number is nothing 
but the class number fonnula of Dirichlet. 


T. Arakawa et al.. Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_7, © Springer Japan 2014 
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B P + 1 


-=j- = — 2B P +i mod p 
_ 2 


2 


and hence the theorem can be written as 



The class number formula (6.6) at the end of the last chapter gives 



This shows that the class number h(—p) is uniquely determined by its value 
mod p. Therefore, the above congruence asserts that the value £((1 — p)/2) of 
the Riemann zeta function “knows completely” the class number h(—p). 

(3) For a more advanced interpretation of this theorem using /r-adic modular forms, 
see [82]. 

(4) An analogous result in the real quadratic case is known as the following 
Anke ny 3 - Artin 4 - Ch o w 1 a 5 congruence [4] : Let p be a prime number congruent 
to 1 modulo 4, and s = (t + uj~p)l 2 be the fundamental unit. Then, we have 


u 


h(p) = Bp - 1 mod p. 


Example 7.3. Let us compute several class numbers by using the theorem. 
(1) For p = 7, 



so h(— 7) = 1. 
(2) For p = 11, 




—IB n+i = —2 B(, = —2 


so h(— 1 1) = 1. 


3 Nesmith Cornett Ankeny (bom in 1927 in Walla Walla, USA — died on August 4, 1993 in Seattle, 
USA). 

4 Emil Artin (bom on March 3, 189S in Vienna, Austria — died on December 20, 1962 in Hamburg, 
Germany). 

5 Sarvadaman D. S. Chowla (bom on October 22, 1907 in London, England — died on December 
10, 1995 in Laramie, USA). 
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(3) For p= 19, 


/ 5 \ 5 5 

-25i9+i = -2B 10 = -21 — ) = - — = - — — = 1 mod 19, 
2 \ 66 / 33 5 — 38 

so /z ( — 19)= 1. 

(4) For p = 23, 

( 691 \ 691 1 

2B 23+1 = — 2B\o = —21 ) = = - = 3 mod 23, 

-f- V 2730 / 1365 8 


so /;(— 23) = 3. 


7.2 “Hurwitz-integral” Series 


Definition 7.4. A formal power series with rational number coefficients of the form 
t” / n\ such that all c„ are integers is said to be a Hurwitz-integral series. 
The set of all Hurwitz-integral series is denoted by PL. 

Note that P(t) e PL if and only if P( 0), P'( 0), P"( 0), P ( 3 ) (0), ... are all in Z. 
Typical examples of elements of PL are e‘ and log(l + t). 

Proposition 7.5. (1) PL is an integral domain. 

d 

(2) PL is closed under term-by-term differentiation — and integration 

dt 

(3) The set of invertible elements o f PL is PL X = {P{t) e Tl\P(0) = ±1}. 

Proof. (1) We only need to show that the set PL is closed under multiplication, 
other properties being trivial to verify. For two series P(t) = YlT=o c n t" /n\, 
Q(t) = /n\, write the product P(t)Q(t ) as Yl^Lo t"/n\. The 

numbers e„ are given by 

n 

e » = E 

1=0 

and hence are integers if c , , d,,-, are all integers. 

(2) For P(t) = Y1T= o c „ t n /n\ e PL, we have 


. I Q d n - 



CXJ ffl — 1 ~~ fH 

P\t) = Y^Cn——— = 


t n 


(77-1)! ^ 

= 1 v 7 /? — 0 


ft 00 

j ' P(t)dt = Y c " 


f n + \ ^ 


t n 


(77 + 1)! 

( 1=0 v 7 (1 = 1 


— ( n I 


Both of these are clearly in PL. 
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(3) In order that P (f ) = Y2T=o c « t'' / n \ is invertible in Q[[t]], the condition Co 0 
is necessary (Proposition 1.9 on p. 15). This being so, write the reciprocal 
Pit)- 1 in Q[[?]] as t n /n - To have G PL, we need 

bo = Cq 1 g Z and thus Co = ±1. If this is satisfied, other coefficients b„ 
automatically belong to Z, because b„ is the constant term of the nth derivative 
( P(t)~' )'" > of P{t)~ l and the derivative is of the form 

/ , _n(n) polynomial in Pit). P'(t). P" (t), . . . with integer coefficients 

power of P(t ) 


and so, if Co = P( 0) = ±1, then the value of this at t = 0 is an integer since 
-P(O), P'i 0), P"( ()),•■• G Z. We therefore have P(t) G PL X . Conversely, if 
Pit) G PL X , then from P{t)~ x | r= o = c^ 1 G Z we have Co = ±1. 

□ 

Remark 7.6. The series sin(f), cos(f ) (Taylor expansions of sin(.x), cos(x) at the 
origin, viewed as formal power series in x = t) are both in PL. and cos(f) = 
1 — t 2 / 2! + r 4 /4! + ••• is invertible by proposition (3) above. Hence the series 
tan(t) = sin(f)/ cos(f) is Hurwitz-integral. This shows that the tangent numbers T„ 
in Remark 1.18 (p. 24) are integers. 

Proposition 7.7. Suppose P(t ) G PL. P( 0) = 0. Then for any N G N we have 


P(t) N 

N\ 


g PL. 


Proof. We proceed by induction on N . The case iV = 1 is trivial. Assume the 
proposition is valid up to N — 1. Using (P(t) N / N\)' = P(t) N ~ l P'(t)/(N — 1)! 
and P( 0) = 0, we have 


P(t) N 

N\ 


r 1 Pit )"- 1 
‘0 (N - 1 )! 


P'(t)dt. 


The induction hypothesis and Proposition 7.5 (1), (2) show the right-hand side 
belongs to PL. □ 

Definition 7.8. For series P(t)(jL 0 ),Q{t) G PL, we write P(0l//(2(0 or 
Q(t) =h 0 mod P{t) if the condition Qit)/P)t) G PL holds. Also we write 
Qi(t) =h Qiit) mod Pit) if Pit)\ H iQ\it) - Qiit))- 

In particular, when m is an integer (this is also an element in PL), the congruence 
Qiit) =h Qiit) mod m means, if we write Qft) = J2T=o c '^ Qiit) = 

Y2T=a c « 2> t"/n\, that the congruence cjY = cj, 2> mod m holds for every n. 
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7.3 Proof of Theorem 7.1 

Put (p — l)/2 = m. We have m > 1 because of our assumption p > 3. In the class 
number formula ((6.6) on p. 90) 

divide the sum into two parts according to m < /?/ 2 and 
= —1 because p = 3 mod 4. Then we obtain 

Ph(~P) = ~ E 

0<u<p/2 ^ * 


i p ~ l 

h(- P ) = --j2\ 


u > 


p! 2, and note 


-2 


E (E) 

<u<p/2 v F ' 


( P - u) 


(7.1) 


0<u<p/2 

2 £ + ' E (j)- 

0<u<p/2 ' 0<u<p/2 

On the other hand, if we divide the sum according to the parity of u, then we have 

= - E (f)< 2 “>- E 

0<u<p/2 y ^ ' 0<u<p/2 V " / 

E (f)-' E (f)- 

0<u<p/2 v ^ 7 0<u<p/2 ^ ' 

From this we have 




■“ E 

<U* 

times (7.1) to obtain 

H!)) 


u 


* E + r E (J). 

0<u<p/2 yy/ 0<u<p/2 yF/ 

ubtract (7.2) from two tir 


x , ( u 

ph(-p) = p 


0<u<p/2 

■ £ (i 

0<u<p/2 v ^ y 


(7.2) 


hence 


i m / 

‘""■hbSC 

'ing Euler’s criterion = 


a \ 
PJ ’ 


mod p (cf. [50, Proposition 5.1.2]) here, 

mod p. 


we 


1 m 

— 1 — 

2 - 2 m ' 
a = 1 


*(-p) = 
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Hence, the proof of the theorem h(—p) = —2B m+l mod p boils down to showing 


= — 2(2 — 2 m )B m+ \ mod p. 

a= 1 

Further, by 2 = ^ mod p and -(2 - 2 m ) = -(2 2m+1 - 2 m ) = 2 m (l - 2 m+l ) 
mod p, this reduces to showing 

m 

Ya m = 2'" ( 1 - 2 m+1 ) — mod p . 

' m + 1 


First, by ^ = 1 - j + J27=2 B n% r> we have 
1 1 2 


e‘ + 1 e' - 1 — 1 




n=2 


tn - 1 


1 V-S 

= - + y:a-2")«, ( — 


n=2 

oo 


- + V(l-2" +1 )^^--— . 

2 ^ 71 + 1 77! 

H=1 


Replacing t 2t and multiplying both sides by 2, we obtain 

D fll 

n + l/i oj + K+M ‘ 


e 2r + 


- = l + V? +1 (l-2" +1 )^- 

1 ^L, 77 + l7t! 


(7.3) 


Now since 
2 


e 2r + 


1 /l-e 2r \ /l-e 2r \ / 1 — e 2i \ 

t = TTe,i = ‘ + (— ) + (— ) + (— ) + 


and ^ 1 f j 2" 1 ^ e "H, we have f2 2 + 1 e A. Also, by Proposition 7.7, 

/ 1 _ 2r \ / | 2/ \ tv 

we have I 1 =# 0 mod A!. In particular, if A > 2, we have I l-y— 1 =// 

0 mod 2. By this, combined with the fact that the coefficient of ^ in 1 -T' is 
divisible by 2 if 7? > 2, we have 


e 2r + 1 


= h 1 + t mod 2 . 


(7.4) 
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Therefore, by (7.3), we have 


2»(i _ 2 "+ 1 )^±i € z 
n + 1 


if n > 2. Next observe 


m 


m 


2 / 77+1 


(1 + e 2t )J2 e4at = (**" + e (4a+2) ') = ^ 

a = 0 a = 0 7=0 

P 00 ( 7it \n OO p 

= EE + = EEw»^ 

7 =0/7=0 7i=0 v=0 7 

°° / P \t n 

= r + i + E 2 "(E/+ 

n=l v /=l 7 ' 


r" 


=« 1- Z 2" — - mod p. 

' 77! 


p-i|« 


(Here we have used (3.1) on p. 42, i.e., j n = —1 mod if p — l|/7 and 

^2j = | j n = 0 mod /? otherwise.) Multiplying both sides by gl , 2 + 1 (e 7f), we have 


2 Z e4a ' 

o=0 


= 77 


+ 


t(‘ + +) 


p— 1|« 


mod p. 


Comparing the coefficients of on both sides, we obtain by (7.3) (note m = < 

/'- 1) 


B 

2 V(4r/) m = 2 m+1 (l — 2 m+1 ) — mod p. 

' 777+I 

a=0 

Dividing both sides by 2 and using 4'" = 2 P_1 = 1 mod p, we have 
V a" = 2'" ( 1 - 2 m+1 )-^±l mod p. 

' 777 +1 


This is what we wanted to show. □ 

Exercise 7.9. Compute the class number /? ( — 3 1) by using Theorem 7.1 and 
Table 1.1 in Chap. 1. 
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Exercise 7.10. Let p be a prime congruent to 3 modulo 4. Prove that the numerator 

of B p+i can never be divisible by p. 

2 

Exercise 7.11. The Euler number E n is defined by the generating series 

7 00 t n 

t =X' E J-. 

e‘ + e 1 ' " n ! 

77=0 

Prove that all E„ are integers. Moreover, prove that E2,, is odd for all n > 1. (£ 2 / 1+1 
is easily seen to be 0.) Hint: Use (7.4). 


Chapter 8 

Character Sums and Bernoulli Numbers 


We would like to explain arithmetic identities between Bernoulli numbers and the 
root of unity. Namely we shall explain certain small collections of formulas between 
exponential sums or character sums and Bernoulli numbers. We often encounter 
such formulas when we compare the dimension formulas of modular forms obtained 
by the Riemann-Roch theorem and by the trace formula. Often, the exponential 
sums appear in the first method and the Bernoulli numbers appear in the second 
method. But sometimes it is not easy to prove these relations in an elementary way, 
and there are several cases that no elementary proofs are known. It seems that there 
is no general elementary way to study these relations. Our elementary method in 
this section is very restrictive and clearly not enough to attack difficult problems, 
but still, it should give us some general feeling of what are easy exponential sums. 

Before going to easy things treated in this section, we give here one example for 
which no elementary proof is known. 

Let p be a prime such that p = 3 mod 4. We put £ = e 27TI,,p . Then the following 
formula holds. 


E 

( a,b,c)GS 


\j/(abc) 

(l-na-w-n 



+ — B^f 
6 


Here we put S = |(a, b, c) e (F*) 3 ; ab + be + ca = Oj , F p is the prime field 
Z/ pZi with characteristic p, \[r is the quadratic residue character defined by ij/(x) = 
and B n f is the generalized Bernoulli number. 

Motivated by dimension formulas, this relation is conjectured in [69], and later 
proved in [47], The proof there is a remote indirect proof. It is natural to ask if there 
exists a more elementary and direct proof. But no elementary alternative proof is 
currently known. Assuming that such a direct proof exists, we have no idea if such 
a proof is simple or complicated before we really prove it. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_8, © Springer Japan 2014 
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The results we shall give below are much easier compared with this example. 
But we see that even researchers sometimes fail to recognize that these are easy, 
so we think it is of some use to explain such elementary formulas here. To fix our 
standpoint, we assume here that we are satisfied if an exponential sum or a character 
sum can be written by the generalized Bernoulli numbers. 


8.1 Simplest Examples 

We start from simple examples which can be treated easily. For a prime p , we put 
= e 2n 'l p . As one of our examples, we consider the sum J^_|(l — £“) _1 . In fact 
this sum is equal to 


E 


a= 1 


l 


p-i 

2 


(8.1) 


Similar sums obtained by attaching the quadratic residue symbols or character 
values of a to the numerator of 1/(1 — £“) can be expressed by using Bernoulli 
numbers and Gaussian sums. We call loosely this kind of sum obtained by 
combination of roots of unity and characters an exponential sum or a character 
sum. As for the first example (8.1) we gave above, the argument is very simple. 
For similar sums with some characters too, this kind of formula can be obtained by 
fairly easy calculation. 

Proof 1. We give the first proof. Since (1 < a < p — 1) are mutually different 
p — 1 roots of the equation X p — 1 = 0 which is not equal to 1, we have 


X p - 1 


p - 1 


- = x p ~ l + x p ~ 2 + ■ ■ ■ + 1 = Y\(X - p. 


a= 1 


Take the logarithm of both sides and compare their derivatives with respect to X. 
Then the left-hand side is 


-^-\og(X p - 1 +X p ~ 2 + ---+l) 


E P a Z\aX° 

X p -i + 


dX 

and the right-hand side is 


d 

~dX 


E' 0 g(*-p = E 


< 3=1 


<3= 1 


1 


Here if we put X = 1 , then we get the relation to be proved since p 1 'YJa= 1 a = 
(P~ l)/2. ~ ' 


□ 


8. 1 Simplest Examples 
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Proof 2. The next proof might appear in an exercise of the elementary algebra. 
Here we use a little tool of algebra (Galois theory of cyclotomic fields). We never 
use this argument later, so if the readers are not familiar with this tool, they are 
recommended to skip this and to go directly to Proof 3. We call the field generated by 
the nth roots of unity over the rational number field a cyclotomic field. Gauss studied 
cyclotomic fields deeply [35, Chapter 7]. The cyclotomic field Q(f p ) is a Galois 
extension of Q, and the set of the mappings o a : t, p m>- for integers a with 1 < 
a < p — 1 gives the Galois group of Q(£ ? ) over Q. So the sum which appears in (8.1) 
is equal to the trace of x = 1/(1 — £ p ) e Q(£ p ) overQ, that is, the sum a a (x), 
where o a (x) is the action of the element ct a in the Galois group on x. This trace is 
invariant by each element o a of the Galois group, so by the general Galois theory, 
we see this is a rational number. The degree of extension of Q(£ p ) over Q (namely 
the dimension as a vector space over Q) is p— 1, but since we have t, p = I — I / x, x 
also generates Q(£ p ), so the minimal polynomial of x over Q is also of degree p — 1 . 

The minimal polynomial of t, p is 1 + X + X 2 H b X p ~ l = (X p — \ )/(X — 1), 

so the minimal polynomial of —\/x = t, p — \ is given by ((X + l) p — \)/X . If 
we put X = — 1/7, then this is written as — Y l ~ p ((Y — 1 ) p — Y p ), so the minimal 
polynomial of x whose coefficient of the highest degree is one is given by 


(Y p - (7 - 1)0- 


P 


The trace of x is — 1 times the coefficient of Y p 2 , so by the binomial theorem, 
we get 


1 r P(P-1) _P~ l 


— x 


P 


Hence we prove (8.1). 


□ 


Proof 3. Proof 2 is somewhat clumsy and similar methods cannot be used so easily 
in other cases as imagined. Here we explain a more general method. Let t be 
an indeterminate (i.e. a variable, or an element transcendental over the field in 
question), and consider the rational function 


1 


1 ~t.pt' 

By expanding this as a formal power series along t = 0, we have 
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If |*| < 1, this calculation is valid also as a convergent power series. If we take here 
t —y 1, then by the definition of differentiation (or the theorem of l’Hopital 1 ), we 
easily see that 


1 


1 

p 




In the above calculation, we used only the properties that %p = 1 and ^ 1 . so 
for any a prime to p, we can apply the same calculation to £“ instead of £ p , so we 
get also 


1 

1 - 


1 P ~ X 

--£<■ 


Here if we take the summation over a , we have 


£5' 


p - 1 


£tr-i = 


a = 0 


-l 

p- 


if c ^ 0 mod p, 
if c = 0 mod p. 


so we get 


E 


i 


i 

p 


£(-o 


c=l 


1 

— X 

P 


P(P~D 

2 


P-1 

2 


Hence we prove (8.1). □ 

An advantage of Proof 3 is that we can calculate mechanically without any 
special insight on the result beforehand. 

An alternative proof for the expression for ( 1 — £ a ) _1 that we obtained in Proof 3 
above will be given also in Lemma 8.5. By the way, we see of course without any 
calculation that the element (1 — £ a ) _1 of Q(X P ) is a linear combination of the 
basis £p 1 of Q(^ p ) over Q. In principle, for any element of a finite field 

extension K over k, a concrete expression of the inverse by a basis can be obtained 
by Euclidean algorithm on polynomials. But we cannot expect in general that such 
a linear combination is written by a simple formula. Since we have an accidentally 
easy expression in the above case, we could use it. 


'Guillaume Franfois Antoine de l'FIopital, (born in 1661 in Paris, France — died on February 2, 
1704 in Paris, France). 
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8.2 Gaussian Sum 

Before we start explaining the relation between exponential sums and the general- 
ized Bernoulli numbers, we explain about Gaussian sums. The simplest definition 
of a Gaussian sum is given by 


p - 1 




for an odd prime p. This number was calculated by Gauss and the result is very 
beautiful. Namely we have 



Here the choice of the square root is given as follows. If p = 1 mod 4, we take J~p 
as a positive number and if p = 3 mod 4 we take J—p so that its argument is tt/2. 
As we shall see later, it is very easy to prove that g(p ) 2 = (—l)( p ~ 1> / 2 p, but it is 
considerably difficult to determine the sign of the square root in g( p). It is said that 
Gauss spent 4 years getting the proof. 2 

At the present time, there are many known ways to determine this sign, but it 
would be beyond the scope or different from the aim of this book to carry those 
proofs here. So we would like to ask readers to refer to some other books on the 
determination of the sign. (For example, many proofs are introduced in [15].) 

Now, this Gaussian sum can be also expressed in a slightly different way. Let 
i /r be the quadratic residue symbol modulo p defined on p. 88 (i.e. = yj'j) 

and put 


2 Gauss wrote in his letter ( Collected Works X, p. 24) to Olbers dated September 3, 1805: “Since 
four years ago, a week has seldom passed when I had not made one or another attempt to solve this 
difficulty, especially lively now also again in the latest period. But all pondering, all search have 
been in vain, and each time sadly I must have laid down my pen again. At last a couple of days 
ago it succeeded, not for my wearisome search, but only through the grace of God, I would like 
to say. As the lightning strikes, the problem has been solved; • • • Strangely enough, the solution 
of this problem appears now easier than many others which did not well keep me so many days as 
these years, and certainly no one will, when I give a lecture on this matter some day, get suspicious 
of the long dilemma in which it brought me.” Incidentally, Olbers (Heinrich Wilhelm Matthias 
Olbers, 1758-1840) was a medical doctor and an astronomer. The biggest asteroid, Ceres, which 
made Gauss leap to fame, was discovered by Italian astronomer Giuseppe Piazzi (1746-1826) on 
New Year’s day in 1801, but became lost through the difficulty of observation, then was accurately 
located by Gauss by calculation. It was rediscovered by Olbers on New Year’s day in 1802. Also, 
he newly discovered the asteroid Pallas in 1802 and Vesta in 1807. Olbers was a friend of Gauss 
whom he visited most often. 
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P~ 1 

gif) = Z^”^' 

n= 0 

For each n, the number of representatives mod p of x e Z such that n = x 1 mod p 
is given by 1 + \[r («), so we get 


p — i p — i p — i 

soo = Z (1 + ^(«))^ = Z + Z fi^l- 

77 =0 77=0 77=0 

Flere the first sum is obviously 0, so this coincides with g ( i// ) . 

Here we shall explain more general Gaussian sums which will be needed later. 
If we change x 2 to a quadratic form of several variables and integral vectors in 
the above definition of Gaussian sum g(p), we can define the “Gaussian sum of 
quadratic forms”. This plays an important role in the transformation formula of 
theta functions for example, and it is possible to calculate this as the above formula 
of Gauss, though we do not go into this direction here. On the other hand, if we 
change \[/ into a general Dirichlet character in the definition of g if), then we can 
define a Gaussian sum associated with a character. We treat this in this book. 

Definition 8.1. Let x be a primitive Dirichlet character with conductor / and put 
£/ = e 2n 'lf . We write 


/-i 

g(x) = Z *(”)£/ 


77=0 


and we call this the Gaussian sum associated with character /. 

In such a general case too, the absolute value of g(x ) is a simple quantity and 
given by gix)g(x) = f ■ (The proof will be given later.) But for general /■ there is 
no formula of the Gaussian sum itself which is simple and explicit. When / 2 = 1, 
there are some formulas which will be explained later. 

Lemma 8.2. Let x be a primitive Dirichlet character with conductor f and let a 
be an arbitrary integer. Then the following formula holds. 

f 

*(«)*(*) = Z *(">£/ • 

77=1 


In particular, we have g(x) = x (~ !)<?(/)• 

A good point of this relation is that we are not assuming that a is coprime to / 
and this is often useful for calculation of exponential sums. 



8.2 Gaussian Sum 
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Proofs of Lemma. If a is coprime to /, the relation is obtained trivially by replacing 
n by an in the debnition of the Gaussian sum g(x)- If a is not coprime to /, then the 
left-hand side is of course 0 by debnition of a Dirichlet character, but the right-hand 
side is also 0 by the following reasoning. If (a, /) = d > 1, then 'Cff depends only 
on n mod f / d , but elements 1 < n < f such that 1 mod f / d form a subgroup in 
(Z//Z) x and so for each bxed « ( > coprime to /, we have 

X(n) = X( n o) X(») = °- 

n=no mod f /d n= 1 mod f /d 


The second relation is obtained by putting a = — 1 and taking the conjugate of both 
sides in the brst relation. Namely we have /( — 1 )g(x) = g(X) and since /(— I ) 2 = 
X(l) = 1. we have /(-l) -1 = *(-l). □ 

Since the Gaussian sum is never 0 as we see below, the above formula can be 
written also as 


/ 

*(fl) = sGD -1 E *(«)*/"■ (8 - 2) 

n = 1 


Lemma 8.3. We have 


g(x)g(x) = f 

Proof. By Lemma 8.2, for any integer a, we have 


/-i 

X(a)x(a)g(x)g(x) = X(n)x(m)^f"~ m) . 

n,m=l 

Summing up both sides from a = 0,...,/ — 1, the left-hand side becomes 
( P(f)g(x)g(x) since x( a )x( a ) = 1 if ( a > /) = 1 and 0 otherwise. Here (p(f) is 
the Euler function, namely the number of positive integers not more than / which 
are coprime to /. By using 


/-i 

E c ,a(n—m) 

v 

a = 0 


jo if n m , 
[ / if n = m. 


the right-hand side becomes 


/-i 

/ x J2 X(n)x(n) = f<p(f). 

n = 1 


Comparing both sides, we prove the lemma. 


□ 
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Next we will give a formula for the Gaussian sum for a primitive Dirichlet 
character with j 1 = 1 . 

Proposition 8.4. Let / be a non-trivial primitive Dirichlet character such that 
X 2 = 1. Then there exists a quadratic field K such that /(m) = Xk(u) = 
where Dk is the fundamental discriminant of K. Furthermore, for such x , we have 

g(XK ) = sTDk- 


By using the Chinese remainder theorem, we decompose the sum in the definition 
of g(XK ) into powers of primes and we reduce the formula to those in the case of 
primes. In the middle of the proof, we need the reciprocity law of quadratic residues. 
We omit the details here. We refer to [15]. See also Exercises 8.26 and 8.27. 


8.3 Exponential Sums and Generalized Bernoulli Numbers 


In this section, we consider sums similar to those in Sect. 8.1, attaching characters to 
them. In fact, these new sums are far more interesting. In the previous sections, we 
took only the prime power root of unity, but in this section we consider the / th root 
t,f = e 2n '!f of unity. Since it is troublesome to write the suffix / always, we fix a 
natural number / once and for all and we denote simply by f We will explain 
below how to obtain formulas for sums like those in the last section by generalized 
Bernoulli numbers and simple combinatorial numbers. 

First of all, when a is not divisible by /, we get the formula given below. (This 
formula will be used also in Chap. 11, Sect. 1 1.2.) 

Lemma 8.5. If f \a, then we have 


1 


1 

7 


/-i 


c=\ 


Proof. This is obtained by the same calculation as in Proof 3 in Sect. 8.1, but here 
we give more direct proof. By expanding polynomials, we have 


/-i 

(1 -X) x J2 cXC 

C— 1 


/-i 


-(/ - \)Xf + £(r - (c - l))X c 


C= 1 


-fxf + ^> c . 


c= 1 
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If we substitute as X = in the above, then the second term of the right-hand side 
is zero since J2c=i c = £“(1 ~ £“-0/(1 — £“) = 0, and we have 


/-i 

(i -nJ2 c s ac = -fs af = -f 

C= 1 

Dividing both sides by — /(I — £“), we have the equality in the lemma. □ 

Now we consider a Dirichlet character / whose conductor is / (i.e. a primitive 
Dirichlet character modulo /). First, we put 

n , , X ( a ) 

p klx) = L (l-raf 

and try to describe this. Of course if / = 1 , then the denominator becomes zero so 
this has no meaning. So we assume that / > 1 from now on. As in the last section, 
we define the Gaussian sum with respect to / by 

/ 

ga) = £z(«)r 

a= 1 


To treat the exponential sums slightly more generally, we put 


Pk,x(t) 


y- X(a) 

^ (1 — t a e‘) k ' 

a = 1 


We have Pk,x( 0) = Pk(x )• The next relation holds. 

^i.z(O) = (8.3) 

Proof. By the last lemma and Lemma 8.2, we have 


i / , f 

^l.z(O) = -jJ2 ct > aC = t Six) *( c ) c = -Six)B\,j. 

J 0 = 1 C = 1 ’ C=1 


In the last equality, we used the formula (4.1) for B\ x in Sect. 4.2 (p. 54). □ 

In fact. Pi x (t) is almost identical to the definition of the generalized Bernoulli 
numbers. Indeed, calculating the development, we have 


Pi.xit) 


a= 1 7=0 


“J oP 


g(x) Y. xU)i 


T,j=oxU) e - 

ef‘ - 1 


Jt 


-g(x) 


jt 
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(Here we are regarding e l as a formal variable and expanding it as a formal power 
series. Or we may say we are expanding it in the neighborhood of t = —oo. 
Anyway, in the final stage, it is an equality between rational functions of e‘ , and the 
calculation is justified regardless of any interpretation.) Since we are taking / > 1, 
we have /(O) = /(/) = 0. So by definition, we have 

°° jtl — 1 

Pijt) = -g(x)J2 B ”x-ir- 

n = 1 


Hence we have 

^ i( 0,= <;,0) = - s O)A T *, +1 , I . (8.4, 

Next we want to give a formula for Pk, x ( 0). Now we put 

1 

Qk,av) - (J _ £a e ryt • 

Then we have 


/ 

p k.x(t) = j]x(a)S/t,a(0. 

a= 1 


and 

~T~ Qk,a(t) = k(Qk+l,a(t) ~ Qk,a(t )), 
at 

so inductively Qk,a(t) is expressed by higher- order derivatives of Q\, a (t), so Pk,/(J) 
is also expressed by derivatives of higher order of P\, x (t). More explicitly, by 
using the Stirling number of the first kind (as for the definition, see Sect. 2.1), it 
is expressed as follows. 

Proposition 8.6. 


Qk,a(t) 


Pk, X (t ) 


(k- 1)! 
1 

(k — 1)! 


k—\ 

E 

k 

J + 1. 

gS( o, 

1=0 



k - 1 



E 

1=0 

K 

_ l + 1 _ 

pS( { ). 


Proof This is proved by induction with respect to k. The assertion is true for 
k = 1. We assume it is true up to k. Since Qk+\,a(t) = Qk,a{t) + j: Qk,a(t), 
we can express Qk+i,a(t) by the linear combination of derivatives of Qk,\(t) by the 
inductive assumption, and the coefficient of Q^lft) in Qk+\,aU) in this expression 
is given by 
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1 


k 

l + 1 


1 

+ k 


forO < l < k. By the recurrence relation of the Stirling number of the first kind 
(p. 28, Eq. (2.2)), this is equal to 


1 

jfc! 


k + 1 
/ + 1 


Hence we prove the assertion. □ 

Corollary 8.7. 


PkJ 0 ) 


X(a) 

(i - n k 


g(x) r k 

( k - w + 


B i+hx 

/ + 1 


(8.5) 


Proof. In the second formula in the proposition, put t = 0 and then use (8.4). □ 

Example 8.8. For example, we have 


v ~~' 1 X(a) 

d - « 


= ~g(x)Bi.j, 


Y' xfo) 
“ (i-C a ) 2 

X(a) 


-£<X) , 

_ £<X) • 


In the above, we assumed that the character / is primitive in order to evaluate 
(0) by Bernoulli numbers in the final stage, but we did not use this condition at 
any other place. So, in the above, even if we take 1 instead of /(a) for any a, we 
can execute almost the same calculation. For example, we get 


/-i /-i 

Ee."«) = E 


a= 1 


a=\ 


l 

1 - ^e 1 


f L_ 

l - e f‘ 1 -e r 


= /- 


fe ft 
eft - 1 



= /- 1 + E 5 ' 

n= 1 


(i -/"V "- 1 

n\ 
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Hence if we put t = 0 here, we have 


E 


a= 1 


l 


/-i 

2 


since B\ = 1/2. The case / = p is the formula in Sect. 8.1. 

We give similar formulas in more general cases. Noting the relation 


/_1 1 

E fi?>) = 0 - / /+1 )^/+1 + Sio(f - 1) 


( 8 . 6 ) 


(where 8/o is Kronecker’s delta, that is, 1 for 1 = 0 and 0 otherwise), we get the 
following formula. 

Lemma 8.9. 


/-i . /-i 

E = E e^(o> 

a=l V S ' a= 1 


A:— 1 


= 1 E 


k 

l + 1 


(1 -/ , + 1 )fi / + i + (/- 1)5/0 }. 


/ + 1 


We give examples of this formula. 
Example 8.10. 


E 


a= 1 


l 


E 


a= 1 


l 

(i - ! a ) 2 


E 


a= 1 


l 


E 


a= 1 


l 

(i - ! a ) 4 


/-i 

2 ’ 

(/-!)(/ -5) 

12 

(/-!)(/ -3) 

8 

(/-D(/ 3 + / 2 — 109/ + 251) 
720 


Using similar methods, we will give formulas for the following sums: 


/-i 

E^ r 

6=0 


8.3 Exponential Sums and Generalized Bernoulli Numbers 


115 


When / divides a, then we have 'C, ar = I , so this should give the formula to describe 
sums of powers. Also under the assumption that / does not divide a , we give 
formulas to express these sums by Qk, u (0) . This is a generalization of the formula to 
express the right-hand side of Lemma 8.5 by the left-hand side. (These formulas will 
be applied to calculations of exponential sums later.) As usual, for natural numbers 
k, j , we denote by ( k ) the binomial coefficient 


( k \ = - 1) • • • (k - j + 1) 

w ” j] 

Here for j = 0, we put (^) = 1 . 

Lemma 8.11. (1) When a is divisible by f , we have 


f 

c = 0 



c = 0 



f k ~ i+l 

k-j + 1 ' 


(2) When a is not divisible by f , we have 

f k / k—m + 1 

^ c fc r= ^ ( _ i r(m _ i )!ema (o) £ (-1 f-Jfi 

c = 0 m = 1 \ 7 = 1 

The first formula is nothing but the formula (1.1) in Chap. 1. 

Proof. First we assume that a is an arbitrary integer. To extract c k by differentiating 
an easily handled function, we consider the following transformation, taking t as a 
variable. 



/_1 d k ( f \ 

E c *t' v ' = jf 

c=0 yc=0 J 

d k f 1 - e ft \ 

~ dt k VI - Z a e‘ J 

= Qu<0 - E {^f k ~ left Qya(t)- (8.7) 

To show the last equality, we used the Leibniz rule 3 on the higher-order derivatives 
of products. 


3 Gottfried Wilhelm Freiherr von Leibniz (bom on July 1, 1646 in Leipzig, Saxony (now 
Germany) — died on November 14, 1716 in Hannover, Hanover (now Germany)). 
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If / \a, then we have 




Q 1,0 (0 = r 

e' — 1 



OO 


i-E 

n = 0 


?±t *- 1 
n\ 


This has a pole at t = 0. So comparing the constant terms of both sides of (8.7) as 
a power series expansion with respect to t and noting that 


eS(o = - 7 + a~5i)-yf~-- 


B , 


e!2e> = 

" f n t n 


B 


7 + 1 


B 


7+2 


7 + 1 (7 + 2) 


t 


(7 > 1). 


Jt 


= E 


we get 


/-1 

E^ 

c=o 




f j+i 


(7 + 1)! 


x(-l) J 'y!-/* 


*— 1 


n ^ iiti + EE _ 


7=0 
Jfc-1 / 




7 + 1 A: + 1 

,*7 + l , / i+l 


k-j till 


- + 


7=0 V 
k 


E / 


7+1 . 


k - j k + 1 
B, 


~f k 




fk + 1 


k — y+l & 1 

(In the second equality, we used the fact l)- 7 0) j+y = j+_.) So we get 


++. 0 - 


/ 


ir-7+1 


k-j + 1 


In this way, we obtained the formula for the sum of powers again. 

(n) 

Next, we assume that a is not divisible by /. Here we want to express Q x a {t) 
by Qk.ait). This is a converse of the formula in Proposition 8.6 and this time, it is 
described by using the Stirling number of the second kind. (As for definition, see 
Sect. 2.1.) 
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In the formula (5. 1) of Proposition 2.6 p. 28, if we substitute as m — > m + 1 , n 
k + 1, we get 


k + 1 

8 m+hk+ 1 (= 8 m , k ) = (~l) k+1 (- 1 )' 

/ =m + 1 

By virtue of this equality and Proposition 8.6, we have 

k 


k + n 

' / ' 

i 1 

_m + 1 


eg(o = E weffco 


m=0 

k 


k + 1 


= E*- 1 )** 1 E (- 1 )' 


w=0 


l=m + 1 


P + n 

■ / ' 

i J 

77! + 1 


e^(o 


(fc+1 

= E (-!) t+,+1 

1 = 1 


' + 1 ] E r 

* m=0 L 


1 


l 


k-\~ 1 t . i \ 

= E(- 1 ) t+,+ 1 (/ -!)!■ ; 

/ = 1 ^ 


777 + 1 


effto 


So putting t = 0 in (8.7), we have 


/-i 


Ar+l 


E c*£ ae = J2(~ l ) k+m+1 ( m - !) ! 


c = 0 


m=\ 


k + 1 1 

m j 


2m, a (0) 


*-/ + ! 


+ E . /' E (-l)^ +m (m-l)! 


7=o 


k - j + 1 

m 


QmAO) 


k-j + l 


= E E - 1)! 


ri V 


k — j + 1 
m 


fk-m+l 


= ^(-l) m (rn - l)!fi mifl (0) [ E 

i=i 


m=l 


Qm,a(0). 

k-j + l' 
m 


By the above calculation, we get the following examples. 
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Example 8.12. 




_ — / 2 + 2 / 2 / 





— / 3 + 3/ 2 — 3/ — 3/ 2 + 9/ -6/ 


W fl (l-£ a ) 2 (W a ) 3 ' 


Application of these formulas will be given in the next section. 

8.4 Various Examples of Sums 

In this section, we treat several sporadic examples. Sometimes we can use the 
similar method for sums on character values as for sums on roots of unity. We 
shall see this. Let x be a primitive Dirichlet character modulo /. We consider the 
following sum. 


/-i 

Skix) = E x( a i V a k )ci\---a k . 


Here we should note that the sum depends on the range of natural numbers a,- 
(namely, if we change them to other representatives mod /, then the value of the 
sum would change). Even for k = 2, it is not an efficient method to calculate this 
kind of sum by dividing the sum into various pieces and to seek the answers by 
case-by-case analysis. First of all, let us recall the following relation (cf. (8.2)). 


/ 


X(a) = g(ir l J2xW an - 


By virtue of this relation, we get 


/-i / 


Sk(x) = gffl~ 1 E J2^' ,(ai+ ' +ak)a i--- a k. (8.8) 


ai,...,ak=0 n = 1 


But since we have 
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we obtain 


Sk(x) = (~f) k g(x) 1 E n X ^Lk 

n= 1 ' S ' 


= (-ffg(xr l Pk,i( o) 

fk k—1 


(k — 1 )! 


V — 

h l + l 


k 

/ + 1 


B i+\,x- 


For example, we have 


Si(x) = ~ f 2 (*i., + ^2,*) , 

5 3(x) = / 3 + -5 2 , x + • 

We also consider the following similar examples. 


/-i 

S(x.e i,...,e k )= ^2 X( a i d b flilai 1 ••■af. 

ai,...,at = l 

The formula to describe this by Bernoulli numbers can be obtained by almost the 
same method as before. Indeed, as before, writing the character values by Gaussian 
sum and roots of unity, we have 


/ k //- 1 

S(x,e u ...,e k ) = g(xr l ^2x(n) ]~~[ E ^ ? 

n = 1 y = l \ c= 1 


/ 


E (-D ei+ '' 


+eic~ h lk+m\-\ \-mk y/H W* 


n = 1 


1 <mj <e j — lj + 1 
1 <7 <k 


l C i\ l €k\ 

x L ) '■' ( / A ) 6mi+-+m*,n(0)(mi - !)!••• (/Jlfc - 1)! 


S\ — l\ + 1 

m i 


E (-1) 


— 4 + 1 

>11 k 

— He*— h — 



1 </My <ej — Zy + 1 

!</<£ 
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Wll 1 hm/fc 

x 7 T T 777 ( m 1 - 1)! • • • (rn k - 1)! 

(m i H b m k - 1)! 

\e\ — /i + ll f — l k + 1 1 


miH hm/f-1 , r , -i 

E l m\ H hfflt 

7 TT L / + i J B,+l 


Below, we sum up the above formula again. 

Proposition 8.13. We have the following formula. 

e\ n k / 

S(j, ci e*) = (_l)«+“+«+i £ • • • £ (-/) /l+ - +/ * fl z 


h = l h = 1 ;=l 

g l ~~h~ H e k ~h 1 / i\miH k 

mi = l mf = 1 y=l 


^ / — / / + 1 


miH t-ra& — 1 


■-T'lllc — i . 

E rn 

/=o L 


W| b b m k 

/ + 1 


Example 8.14. We give examples of applications of the above lemma below. 


Sk(x) = s (*,i 1) 


(-i) k+1 / k 44 i r k 
(k- D! ^4TT|_/ + i 


./-l e-l r e -l ( 

s Or, e) = X /W"* = X IT T / 

71=0 ;=o V 


/ “ 1 / 

5(X,2,1)= X X(w + «)^ 2 n = -/ 2 ( fB hx + 

vn „=n V 


y^-B 2 , x + ±B 3 , x 


We also give another similar example. Let m be a natural number which is 
coprime to a fixed / and smaller than /. We consider the following sum. 

/ 

An Or) = X* (fl + m A- 

<3= 1 

Then we have 

/ 

And) = g(x )- 1 X am^ a+myn 

a,n= 1 
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= «<»-'( £ mi kr^ + /)|. 

\n = l. («,/)=! ' S ' ■ ) 

but since we have 

/ m — 1 / m — 1 

E /(«)(i - r B )(i - rr 1 = E E /(»)?"' = f(/)E^)- 

n = l,(n,f)=l 1=0 n = l,(n,f) = l 1= 1 

taking the relation ^,{ =1 /(«)(! — £") -1 = — i n the last section into 
account, we get 


DM) = f 


T,X(D + B hx 


\l = t 


We will give several similar examples in the exercise of this chapter. 


8.5 Sporadic Examples: Using Functions 


Lemma 8.15. Let f > 1 be an odd number and let x be a primitive character with 
conductor f . Then we have 

y^ X(a) 

a - wa - w 

= -g(x) (y 4 B ^x + + xV)) B ix + ^(3 + 5W))Bv / ) ■ 

Proof. If we put 


Qk.xiO 


E 


a= 1 


X(q)(l + fV) 

(1 - 'C, la e ll ) k ’ 


Pk, x (t) 


y^ X(a) 

(1 _ fa e t)k’ 


then we get 


00 f n - 1 

Q i .*(0 = ^i ,*(0 = -g(x) E 

71 = 1 
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Since 

= 2kQk+ + (1 k)Qk,x(t) ~ x(2)Pk,x(2t). 

we can write <2 a,/( 0) inductively by Pk x (0), and then by Bernoulli numbers. For 
example, if we put k = 1 in the above relation and take the derivatives, we get 

01, z (O = 2Q 2 ,x(t) ~ Q hx (t)-J(2)P hx (2t), 

02, *(0 = ^(0i>) + Q[, x (t)) + W)Pix(2t), 

and 

02,* (0) = Q B lrx + (1 + , 

Qi, x (0) = (^3,z + Q + K2)) fh.^J . 

In the same way we get 

02,*(O = 403,*(O - 3 Q2.x(n - W)Pl.x(2t) 

and 


03,2 (0) = ^(02,,(O) + 302,2(0) + X(2)P 2 , x m. 

Since the left-hand side of the lemma is equal to Q 2tX (0), substituting this for the 
above value and using the relation P 2x (0) = —g(x)(B i, x + B 2 j/2), we get the 
right-hand side. □ 


8.6 Sporadic Examples: Using the Symmetry 

In the examples below, we denote by i fr(n) = (-) the quadratic residue character 
where p is an odd prime. 
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Example 8.16. We put 


p - 1 

7 = ^ t //(a)a. 

< 3=1 

By definition, this is equal to pB\ ,i, and we know already that B\,j, = 0 if 
\jr{— 1) = 1 (i.e. if p = 1 mod 4 by the quadratic reciprocity law). This is also 
shown directly as follows. We assume that p = 1 mod 4. Substituting p — a for a 
and noting \j/(a) = = \js(p — a), we get 


p - i p - i 

i = ~ a ^p ~ a) = X! - a). 

a= 1 <3= 1 


Hence adding this to the original expression, we get 


p - 1 p - 1 

21 = ^2 f' (a)(a + P ~ a ) = /> ^ i/ f (a) = 0. 

0=1 a=l 


Of course this argument cannot be used when p = 3 mod 4. Actually, if p = 
3 mod 4 and p > 3, then —7/ p is the class number of the imaginary quadratic field 
Q («J—p), so it is a more complicated value. 

Now there are many cases where the above kind of substitution is effective in our 
calculation. For example: 

Example 8.17. The character i jr being as above, put 


p - 1 

7 = \jf{l)lmn. 

l,m,n=\Aln=m 2 mod p 


Then substituting m — > p — m , we get 


p - 1 

27= ^2 ifr(l)ln(m + p — m) 

l ,m ,n = \ ,Aln—m 2 mod p 
P 1 

= p ^2 + 1 )n 

l,n = 1 

p-i 

= p ^2 + \//(l)ln) 

l,n = 1 

= p\p-l)B hx/f . 
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So we have 


T _P\P~ D„ 


Example 8.18. The next sum was first considered in connection with dimension 
formulas of automorphic forms [92]. We put t//(n) = f where p is a prime and 

S = {(s,t,r)eF 3 p - s(r + s)(s + t) ± 0}. 

Put £ = e 2lz, / p and define a sum I by 


'= E 


i fr(s 2 — rt ) 


(s,t,r)€S 


(r +j -i)(^ +; -i)(r j -i)' 


At first glance, it might seem difficult to calculate this sum in an elementary way, 
but actually this can be easily calculated as follows. If we put a = r + s, b = s + t, 
c = — s , then s 2 — rt = c 2 — (a + c)(b + c) = —ab — be — ca. Hence if we put 
T = F* 3 , then substitute —a, —b, —c for a , b, c, we get 


'= E 


(a.b.c)eT 


= E 


(a.b.c)ST 


E 

(a.b.c)eT 


\[f(—ab — be — ca) 

(£ fl -i)tf 4 -i)(£ f -i) 

i jr(—ab — be — ca) 

(r°- i)(r 6 -i)(r c -u 

xl/(—ab — be — ea)^ a ^ b ^ c 

a - na - to - ? c ) 


E 

(a,b,c)eT 


ijr(—ab -be- cci)(t; a - 1 + l)(f* - 1 + l)(£ c -1+1) 

(i;* - m b - m c - ' 


So by expanding the numerator and transposing the same term to the left-hand 
side, we get 

21 = — ^2 \j/(—ab — be — ca) 

(, a,b,c)er 


( (1 -?“)(!- 


+ 


1 


+ 


1 


! b ) ' (1 - t b )(i - n ' (i-! e )(i-! a ) 

1 1 


l - l - l; b l - ? 


+ 1 ) 
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If we fix a and b, then we have 


y, i \r(—ah — (a + b)c) 
C 6FJ 


f— t/r(— ab) if a + b ^ 0 , 

\(p- l)t/r(— ab) = (/?—!) if a = — b ^ 0 . 


So if we fix a ^ 0, then we have 


p-t 

y \ff(—ab — be — ca) = ip — 1) — y \fr(—ab) = p. 

b,c= 1 b^—a, b^O 


Here by Examples 8.8 and 8.10, we have 


E 

a,b€F * 


\//(—ab) 
(l - H(i - 


= pB\ 


Y' 1 = pip - 1 ) PiP - 1 )ip - 5 ) 

P ^ (1 - n(l - S~a) 


3 6f; 


12 


So noting p — 1 = p — \[r(a 2 ), we get 


y i j/{—ab—bc—ca) 

(a,b,c)€T 


(W«)(1-^) 


=-p b U+ 


p(p - 1)0? - 5) pip - 1) 


12 


+ - 


We have also 


E 

( a,h,c)€T 


xj/i—ab — bc — ca) pip — 1 ) 

1 -t, a ~ 2 ' 


y xffi—ab — bc — ca) = pip—l). 

(a,b,c)€T 


So as a total, we have 
3 


PiP ~ i)ip ~ 5) pip- 1) _ 3 2 pip- l) 2 

— ~,PB\,^r 


I = -pB 2 , - — — - — L = - p B\ . 

2 x ' f 8 2 2 8 


In particular, if p = 1 mod 4 then B\ ^ = 0, and if p = .3 mod 4 and p > 3, 
then 5i = —h(—p) ihi—p) is the class number of the imaginary quadratic field 
Qi^/—p)), so there exists a formula for I by class numbers. 

Example 8.19. We put S = {(a, b, c ) e Z 3 ; 1 < a,b,c < p — \,ab + be + ca = 
0 mod p } and 
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/ = ^ \l/(abc)abc. 

(a,b,c)€S 

The condition in the sum / is complicated and it is not easy to calculate / . But if 
we assume that \j/(— 1) = 1, then / can be calculated as follows. By the relation 

/= Y f(~abc)(p - a)(p - b)(p - c) 

(a,b,c)ES 

we get 

21= Y, if(abc)((p—a)(p — b)(p — c)+abc) 

(a,b,c)ES 

= \l/(abc)(p 3 — p 2 (a + b + c) + p(ab + be + ca )). 

(a,b,c)&S 

But we have i fr(abc) = \{f(—c 2 (a + b )) = i //(a + b) and since the condition of the 
definition of S is symmetric with respect to a, b, c, we have 

Y \j/{abc)(ab + be + ca) = 3 Y^ + b)ab 

(a,b,c)€S ( a,b,c)ES 

= 3 Y / ^( a + b)ab 

a,b,a+b ^ 0 mod p 

p ~ l / 1 \ 

= 3 Y 'l f ( a + b)ab = -3 p 2 I B\^ + - B 2 , y, ) . 

a,b= 1 ' “ ' 


Since we assumed i// ( — 1 ) = 1, we have B\f = 0 now. Moreover we have 


Y. + b)(a + b + c) 

(i a,b,c)€.S 



P~ 1 

= -3 Y f( a ) a = -3pBi,f 

a= 1 


and this is equal to 0. We have 

_ p - i p - i 

Y. f( a +b) = Yj VK a + b) = Y / (~ x l r ( a )) = 0- 

a,b,c€S a,b= 1 a= 1 
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So we have 


1 = — ^~ b 2 , f - 


Remark 8.20. When \[r (— 1) = — 1, a simple formula to write down / by general- 
ized Bernoulli numbers is known only conjecturally. 


8.7 Sporadic Example: Symmetrize Asymmetry 

Let \[r (,r) be the quadratic residue character modulo p (p is a prime) and consider 
the following special sum. 


p - 1 

I = t/r(m — An)ijf(m)mn. 


m ,n = 1 


A formula for this sum was first given in [91] and [39] by comparing the trace 
formula and algebraic geometry, but this can be calculated in an elementary way as 
well. We illustrate this calculation below. 

In the above definition, if we take n instead of m — An, then the sum is very 
simple and given by 



But the sum / we actually defined above is subtly breaking its symmetry and that 
obstructs an easy calculation. We get over this by changing it to an exponential sum 
and transforming it to a symmetric one. By the relation 


p - 1 


p - 1 


i/r(m) = g(iA) -1 tim - An) = g(f) _1 J2 


b = 1 


we have 


p i 


p i 


/ = g (f)~ 2 if(ab) mnt; am+Km - in) . 


a,b= 1 m,n = 1 


But for c pZ,, we have '^ n Jj /«£' 


•cm 


— hence we have 
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/ = g(f) 


-2 


P - 1 


E 


i //(ah) 


p - 1 


— Z? mod p 


- + tA(-D E Wl < 


■4/i6 


m,n,b= 1 


P 1 


= ^(-i) E 


i jr(ab) 


P(P~ l) 2 


a,b 
a+by^p 




using g(^r) 2 = xjj(—\)p. In order to calculate / from the above relation, it is 
sufficient to calculate 


a+Zj^/? 

Now a direct calculation of J seems difficult partly because the sum in the definition 
of J is not symmetric with respect to a and b. So we try to rewrite this in a 
symmetric way. If we exchange a and b, this is just a change of notation, so we 
have 


p - 1 


J = £ jj—. 


i j/{ab) 


Si (i-r 4a )(i-^ +a )' 

Now we will add this to the original expression and take the average of both. To 
calculate this, we use the following equality. 


1 1 
+ 


2 - £- 4 “ - £ 


4a 5 — 46 




4a+46 _ £-4# _ ^46 


i-£- 4a 1 i-£~ 4i (i - £ -4a )(i - i;~ Ab ) (l - £ 4a )(i - £ 4i ) 

£4a+4i _ j + Q _ £4a^j _ ^4i)^ 

“ (1 -£ 4a )(l -t 4i ) 

j ^4a+46 


(i-£ 4 ")(i-£ 4i ) 


After all, if we put 


r _ v 


a,Z?= 1 
a+by^p 

p - 1 


* = E 


^(ai)(l -£ 4a+4i ) 


a,b 
a+by^p 


^ (1 - £ 4a )(l - £ 4A )(1 - Z a+b ) ’ 
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then we have 


xl/(— 1) 

j = 


Here we have 


P -\ P - t 


*=--£ E nab)k?<-* 


+b) 


‘ k= 1 a.b = 1 

a+b^p 

1, ,,^2 , r iw 1Vl P(P ~ V 

— (g(V) ~ t(~l){P ~ 0) x ^ = ^ ■ 

p 2 2 


Also we have 


p - 1 

J 2 = E tipb) 


w 


4a+46 


a, 6=1 
a+b^p 

p - 1 ^3 


(1 - £ 4fl )(l - £ 4i )(l - £"+*) 


E L/=i 

n - 


E/=o A(«6K ,<a+i) 


p-t 




4l/r(— 1) 


^ (1 _ £4* )(1 _ (1 _ ?4fl)(1 _ £—4 a)' 

Now, by virtue of Example 8. 10, we have 

p-i 


1 


p-i 

^ (T^^)(i-r 4a ) 


= y f — 1 - ) 

y V 1 - t 4a (i-f 4a ) 2 / 


73-1 + (/> - !)(/> - 5) _ /r - 1 


12 


12 


Furthermore we have 

yla 


P ~ l 1 P ~ l 1 P_1 

E 1 _ M fl = — E f( a ) c ^ ac+al = — E ^ (4c + /)c - 

^ P a,c = 1 P 


a= 1 


c=l 


Hence if we put 5(/) = Er=i (4c + l)c, then we have 


j 2 = i±Pj2s(i) 


2 ^(-D^ 2 -!) 


/ =0 


Summing up, we have 


j = i - — y sd ) 2 = 

4 6 2 p y 

J / =1 


T 7 — 1 , P 2 — 1 1 vA „,,,2 (p-l)(2p-l) 1 


z=o 


12 


-f E 5 « 2 - 

2 pU 


4 
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Next the following quantity should be evaluated. 

3 /p- 1 \ 2 /P-1 

ysxo^iy ih 4 c)c] +(X^ (4c + 1 ) c 

1=0 \e=l / \c=l 


+ ^Xj^ 4c + 2)c j + ^^t/r(4c + 3)cj 


(8.9) 


There are many ways to evaluate this, but here we calculate it just directly. (As for 
more general method of calculation, see [45].) First we have 


p - 1 

y (i/r(2c)c + t/r (2c + l)c) 

C=1 


l ( p ~ x p -* 

- I y 0A(2 c) x (2c) + (2 c + l)(2c + 1)) - y if {2c + 1) 

\c=0 c = 0 



j p - i p-i 

= 9 H f( c )( c + C + p) = HC)C. 

c = 1 c = 1 


Hence we have 


p - 1 

^ l/r(2c + 1)C = 

c=l 


P-1 

(1 -l/r(2»y l/r(c)c. 

c= 1 


In the same way, we have 


p-i 1 4 p-i 

y (i/r(4c)c + if {Ac + l)c + x/f {Ac + 2 )c + if {Ac + 3)c) = - y i jf{c)c 

C=1 c= 1 

P-1 

= y^wc. 

C=1 


This implies that 


p-i 

y(i/r(4c + 1) + if {Ac + 3))c 

C= 1 


p-i 

(i - f{ 2 )) y f{c)c. 

c=l 


Now we consider the case p = 1 mod 4 and p = 3 mod 4 separately. 
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First if we assume p = 3 mod 4, the \jr(— 1) = —1. So we have i/r(4c + 3) = 
—\fr(A(p — c) — 3) = —\jr(A(p — 1 — c) + 1), and 


p — i p — i p - 1 

y, l/r(4c + 3)c = — y t/r (4c + l)(/7 — 1 — c) = y l/f (4c + l)c. 

c=0 c=0 c=0 


Hence in this case we have 


y y 1 - f{2) y 

y t/r (4c + i)c = y i/r(4 c + 3 )c = — - — y i a(c)c. 

C=1 C=1 C=1 

So we have 

E sm 2 = (i + a - tm f + (1 ~ y 2 ) (£ iKOcj 

= (4-3t/r(2))/B^. 

By the assumption p = 3 mod 4, if p / 3 then we have 2?i ^ = —h(—p), where 
h(—p) is the class number of the imaginary quadratic field Q(^/—p ) (p. 90, (6.5)). 

Next we assume that p = 1 mod 4. In this case we have = 0, so we cannot 
separate t/r (4c + l)c and \f/(Ac + 3)c in the above expression in the 
same way. In order to separate these two, we introduce a Dirichlet character 8 with 
conductor 4. Here for x = 3 mod 4 we have 8(x) = — 1 and we see that \fr8 is a 
primitive Dirichlet character with conductor Ap. By definition, we have 

ip-\ p - 1 

4 pB x E f{c)8{c)c = y 0A(4c + l)(4c + 1) - f(Ac + 3)(4 c + 3)) 

c=0 c=0 

= 4(S(1) - S( 3)) = 85(1) = -85(3). 

By virtue of the fact B\ ^ = 0, we have 5(0) = 5(2) = 0. So we get 

5(l) 2 + 5(3) 2 = ^(5,.^) 2 . 

Also we have h(—p) = Summing up all the above results, we get 


/ = 


p{p-l){p-2) 

12 


7/2 if p = 3 , 

p 1 x h(—p) 2 1 A if p = 1 mod 4, 

(4 — 3i/ f (2 ))p 2 x h{—p) 2 /2 if p = 3 mod 4 and p ^ 3. 


By the way, as for further generalization of the results in this section, see [45]. 
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8.8 Quadratic Polynomials and Character Sums 


The example given below is not related to Bernoulli numbers so much, but since the 
method is similar, we take this opportunity to explain it. Here too, we assume that 
p is an odd prime and \[r{x) = (jj is the quadratic residue character modulo p. 

Let P(x) be a polynomial in a variable x. If we consider the sum i//( P(x)), 

then this is related with the number of pairs (x, y) e F 2 of integral solutions of 
y 2 = P(x), so related with the congruence zeta function of algebraic curves and 
a complicated quantity. There is no simple formula for general P(x). But in the 
special case where P(x) is a quadratic polynomial, it corresponds to a conic. In this 
case the structure of the curve itself is simple and there is a simple formula for the 
sum as above. Let us practice the calculation. We write a quadratic polynomial of 
x as P(x) = ax 2 + bx + c ( a, b, c e Z, a ^ 0 mod p). It is not so difficult to 
calculate mod; , \f(P(x)). Since we take p to be an odd prime, we consider the 
finite field F p and rewrite P(x) = a ( (x — d) 2 + e). Then we have 


Y f(P(x)) = ty{a) Yj f(x 2 + e). 

x mod p x mod p 

So this right-hand side is the point. 

Lemma 8.21. 


Y vk x 2 + e ) 

x mod p 


j — 1 if e ^ 0 mod p, 
[ p — 1 if e = 0 mod p. 


Proof. If we write £ = e 2n '^ p , then we have 

p - 1 


n * 2 + e) = 

g( t) ^ 


k = 0 

where g(\j/ ) is the Gaussian sum. So we have 

E + e) = E ^ E 

x=0 syyj k=0 x=0 


x 2 +e) 


But we have ^ r ' = Q ^ kx ~ = x/r(k)g(\l/), so we get the following result. 


E*(*> 2 £* e 

k = 0 


Et*' 


k = 1 


| — 1 if e ^ 0 mod p, 
| p — 1 if e = 0 mod p. 


□ 
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By the way, the result above has of course a relation to the number of pairs 
( x,y ) e F 2 p such that x 2 + e = y 2 . Namely, for each x e F p , there are l + i j/(x 2 + e) 
number of y which satisfy this equation. So p — 1 pairs of (x, y) are solutions of 
this equation if e ^ 0 mod p. 

In general, also for a quadratic form Q(x) of n variables, that is, a homogeneous 
polynomial of variables Xj of x = (xi, X 2 , . . . , x n ) of degree two, we sometimes 
consider the sum mod p and the results are well known, but this is remote 
from Bernoulli numbers and we omit it here. 

Next we treat the sum fix 2 + e ) x - (Note that the sum \jr{P{x))x 
for a general quadratic polynomial P(x) does not reduce to this sum.) Here we have 


p — i p — i p - i 

T. fix 2 + e)x = y (/> - x)f((p - x) 2 + e) = ^(p - x)\^(x 2 + e ), 

x=\ x = 1 x=l 

so we have 

p - i p-i /p-i \ 

2 fix 2 + e)x = p Y f(* 2 + e ) = P ( + e ) _ f( e ) ) ■ 

X = 1 A = 1 \.V=0 / 

Hence we have 


p - 1 

y, i ff(x 2 + e)x = 

A = 0 


(i ±tMlP 
2 

P(P- 1) 


if e ^ 0 mod p , 
if e = 0 mod p. 


As an application, when e f pZ, we can show the following formula. 


p - 1 


y fix 2 + ey 2 )xy = - 

x,y = 0 


(1 + \l/(e))p 2 (p — 1) 


We omit the proof here. 


8.9 A Sum with Quadratic Conditions 

For a polynomial f(x,y,z), we put 

A/(/) = {{a, b , c) e Z 3 ; 1 < a, b, c < p — 1; /(a, b, c) = 0 mod p}. 

Proposition 8.22. For an integer e ^ 0 mod p, we write h e (x, y , ") = exz — y 2 . 
Then we have 
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E 


t(a) 


p - 1 


(< a,b,c)€.M(h e ) 


(1 - £«)(! - £*)( 1 - t c ) 


(1 + t Ke))gi}!r)Bi,f. 


Proof. Denote the left-hand side by I . In the definition of I , we may replace b by 
—b. Noting that 

u-rV + a-^T 1 = i, 

and taking integers a and e such that act = ee = 1 mod p, we have 

f(a) 


p - 1 

2'= E 


, (i - na - p b 2 ) 


p - 1 


p - 1 


a, 6=1 * \ ^ m = l / 

= -- E 7E^T"t(Vr(aem)g(t/r) - 1) 

P a,m = \ ^ ^ 

= 1 + 


□ 


As an application of this formula, we can show the following. 

Corollary 8.23. 


» 3 (n — 1) 

E t(n)lmn = (1 + \fr(e))B h 

(/ ,m,n)€M(h e ) 

Proof. For a while, for any rational number a whose denominator is not divisible 
by p, we understand that we define £“ by regarding a as an element of F p . Then 
we have 


p - 1 

- P 3 i = e lmn E V f («)r ,+Am+CM 

l,m,n = 1 ( a,b,c)€M(h e ) 

l,m,n = 1 tz = l 6=1 



8.9 A Sum with Quadratic Conditions 


135 


P~ 1 

= J2 lmn^ al xl/(a)t;- aem2/4n (^(n/(ae))g( f)-^ em2/An ) 

l,m,n,a= 1 
p - 1 

= J2 lmn^ a, (f(ne)r aem2/in g(^)-^(a)) 

l ,m,n,a=\ 

p -\ 

= (-g(f)f(ne)-f{l)g{f))lmn+pg{f) £ l/r («)//« n. 

l,m,n = \ 4ln=em 2 mod p 


Hence we have 


gW 

,p 3 


(1 + xlf(e))pB hf 


P 2 (P ~ l) 2 


■p \j/{n)lmn 

(/ ,ra ,«)€ M (/14/e) 


In the above calculation, we used = V'’( c )g(V'’)- Here in the final 

expression of I and in the first term of the right-hand side above, the number e 
appears only as i ff(e). Since we have i/r(4/e) = i j/(e), we can replace the condition 
M(h\/ e ) of the sum on the right-hand side by M(h e ). By these considerations, we 
get the formula for £ (/ , m ,„ )eM(Ac) VK n)lmn . □ 

Several such sums with quadratic conditions have something to do with special 
values of zeta functions of prehomogeneous vector spaces. But there are a lot of 
mysteries around this in general. It has not been clarified when such formulas for 
exponential sums can exist or what essential reasons there are for that. For some 
special quadratic conditions, we have several conjectures but proofs are not known. 
It seems premature to treat this problem here, so we omit the concrete description of 
conjectures or other things, but it would be a very interesting problem to study these 
sums from the viewpoint of relations between exponential sums and special values 
of zeta functions of prehomogeneous vector spaces. 

Exercise 8.24. Give an example of non-trivial primitive Dirichlet character / such 
that x 3 is trivial and for that /, calculate the Gaussian sum g(/) explicitly. 

Exercise 8.25. (1) Assume that the cardinality of a finite group G is odd. Then 
show that any character / such that y 2 = I (the identity character) is a trivial 
character. 

(2) Assume that G is a finite cyclic group of even order. Then show that the non- 
trivial character / of G such that / 2 = 1 exists uniquely. 

Exercise 8.26. (1) Let p be an odd prime. For any natural number r, it is known 
that (Z/^' Z) X = Z/// -1 Z x Z /(p — 1)Z. By using this, show that if a non- 
trivial Dirichlet character / modulo p' satisfies /(a) 2 = 1 for any a with 
(a. p) = 1, then the conductor of / is p. 
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(2) Let r be a natural number with r > 2. It is known that (Z/2'Z) X = Z/2Z x 
Z/2 r_2 Z (or ^ ((1 + 2Z)/(1 + 4Z)) x ((1 + 4Z)/(1 + 2 r ~ 2 Z)), the latter is 
cyclic generated by 5.) Show that if a non-trivial Dirichlet character y modulo 
2 r satisfies '/(a) 2 = 1 for any a with (a, 2) = 1, then the conductor of y is 
either 4 or 8. 

(3) Show that if / is any non-trivial Dirichlet character / such that y 2 is trivial, then 
the conductor of y is D k for a fundamental discriminant of some quadratic 


field K. 


(4) Show that if / is any non-trivial primitive Dirichlet character y such that y 2 
is trivial, then / = Xk for some quadratic field K , where xk is defined as in 
Sect. 6.3. 

Exercise 8.27. Let / be a primitive Dirichlet character with conductor /. Assume 
that / = Wi /«2 with (mi, m 2 ) = 1. We regard / as a character of (Z//Z) x = 
(Z/ in 1 Z) x x (Z/ni 2 Z) x and denote by y, the restriction to Z//n,Z for i = 1, 2. 
We also denote by /, the Dirichlet character modulo m, associated with y, . 

(1) Prove that the conductor of y, is m , . 

(2) For any n, put = exp (^p-). We take integers x, y with xm\ + ym 2 = 1. 
Show that for any a € Z, we have 



-ay i-ax 
•m 1 ' 


(3) Show the following relation between Gaussian sums. 


g(X ) = X\(m2)X2(mi)g(x\)g(xi). 


(4) For each primitive Dirichlet character y with conductor 4 or 8 defined in 
Sect. 4.1, calculate the Gaussian sum g(x)- 

(5) Let xk be the Dirichlet character modulo D k defined in Sect. 6.3. Show that 
g(XK ) = '/D~k by using g(p) = J (— 1)(p _ 1) / 2 /> for odd primes p and the 
reciprocity law of the quadratic residue symbol. 

Exercise 8.28. Prove Example 8.14 using Proposition 8.13. 

Exercise 8.29. Show that the following relations hold. 

(1) For any primitive character x modulo /, we have 
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( 2 ) If we denote by t j/(n) = (-) the quadratic residue character, we have 

ST ,r , w/ , 1 ( 2 D 2 />(/>“ l)(7/7 + 1)^ 

2 ^ ^(m + n)f(m)mn = - I p B l f I . 

m,n = 0 ^ 

( 3 ) We have 

p - 1 

y, l/r(/m + 777 77 + 77/) = \/f(—l)p(p — 1). 

l,m,n = 0 

y ljf (/ 777 + 77777 + 71 / ) / = ^r(— 1) — 

l,m,n = 0 

(/ , , , , w2 , , ^p 2 (p~ Wp- i) 

/ -\- mn -\- nl)l = y(— 1 ) . 

l,m,n = 0 

p_l ry’fv 1)2 

y + 777 77 + 7j/)/ 3 = t/f(—l)- — . 

l,m,n= 0 

y \Jf (/ 777 + 777 77 + 77 / ) / 777 = l/f(— 1)^— ^ 

l,m,n = 0 

/,_l „3(„2_n 

y + mn + nl)l 2 m = i/r ( — 1 ) — . 

l,m,n= 0 

y t/t (/ 777 + 777 77 + 77/)/777 77 = ^—(Bl_f) 2 + 1) — ^ 

l,m,n= 0 

(Hint: J 2 n=o + m) + /w) is 0 if / + 777 ^ 0 mod p. In the last relation, 
substitute / , 777, 7? by p — l,p — m, p — n and add that to the original expression.) 



Chapter 9 

Special Values and Complex Integral 
Representation of L -Functions 


As a continuation of Chaps. 4 and 5, we study here properties of Hurwitz 
zeta functions and Dirichlet L-functions such as their analytic continuation and 
functional equation, and calculate their special values at negative integers. There 
are various proofs for the functional equation; here we explain the method using a 
contour integral. Although there would be a viewpoint that it would be too much to 
introduce a contour integral, it is interesting for its own sake and useful too, so we 
venture to derive the functional equation from a contour integral by a method to cut 
out the path of the integral. 


9.1 The Hurwitz Zeta Function 

For a positive real number a, we define a zeta function £(s, a) by 

OO 

£( s,a ) = ^ ( n + a)~ s . 

n = 0 

Since the series on the right-hand side converges absolutely for Re(.v) > 1, £ (.s', a) 
is defined in this range. This function %(s,a) is called a Hurwitz zeta function. 
(Hurwitz [43], Whittaker 1 and Watson" [104, Part II]). In particular, when a = 1, 
£(s, 1) is nothing but the Riemann zeta function ( (.y). In this section, we shall give 
an integral representation of £(s, a). 

We denote by r(s) the gamma function. This is defined by 


'Edmund Taylor Whittaker (born on October 24, 1873 in Southport, England — died on March 24, 
1956 in Edinburgh, Scotland). 

2 George Neville Watson (bom on January 31, 1886 in Devon, England — died on February 2, 1965 
in Warwickshire, England). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_9, © Springer Japan 2014 
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e ‘t s 1 dt 


(9.1) 


for complex numbers 5 such that Refs) > 0. The integral on the right-hand side 
converges absolutely and uniformly on any compact sets in Refs) > 0, and so we 
see that C(s) is a holomorphic function for Ref s ) > 0. By integration by parts, we 
get the most important property 


r(s + 1) = sCfs) (9.2) 

of the gamma function. Through this equality (9.2), Cfs) can be continued 
analytically to a meromorphic function on the whole .s-plane. More concretely, if 
we rewrite (9.2) as 


r(s) = 


r(s + i) 

s 


then since the right-hand side is a meromorphic function for Refs) > —1, V fs ) is 
continued analytically to Refs) > — 1 and s = 0 is a pole of order 1 with residue 1. 
We also have 


r(s) = 


r(s + 2 ) 

sfs + 1) ’ 


and repeating this, we get 


r(s) = 


r(s + M) 

sfs + 1) - - - fs + M — 1) 


(9.3) 


for arbitrary natural number M . Through (9.3), P(s) is continued analytically to 
a meromorphic function on Refs) > — M . Since M is arbitrary, f'(.s) is continued 
analytically to a meromorphic function on the whole s-plane and s = 0, — 1 , — 2 , ... 
are poles of order 1, and the residues can be calculated by (9.3). 

Changing the variable as t — > (n + a) t ( n + a > 0) in (9.1), we get 


1 

(/? + a) s 

Formal calculation leads us to 


OO 


E 


i 

(n + a) s 


I r°° 

— / e -(»+«v- 1 dt. 

r(s) Jo 


i r 00 °° 

— / E e ~(n+a)l t s-\ d 


9.2 Contour Integral 
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and for the range Re(.v) > 1, we see that the right-hand side converges absolutely, so 
the above calculation is justified. Hence, for the range Re(.v) > 1, we get an integral 
representation of £ (s, a ) as 


1 C°° p~ at I C c 

Us,a) = — -f~ l dt = — 

r(s) Jo l-e ' r(s ) Jo 


t e ( l Q ) r 
1 

e r - 1 


s-2 


dt. 


(9.4) 


This integral representation suggests us that %(s,a) has something to do with 
Bernoulli polynomial Bk(a)- 


9.2 Contour Integral 

We transform the integral representation (9.4) to a complex integral in order to 
apply to it the residue theorem of complex analysis. We regard the interval [0, oo) 
of integration as a path of a complex integral, and then expanding this a little, we 
consider the following contour I{e, oo) (e > 0). We define I(s, oo) by a curve 
<p : (— oo, oo) — » C given by 


I(e, oo) : 


(p(u) = 


— u u < — e, 

e exp (?r i '^) —s<u<s, 

u u > s. 


In the definition of / (e. oo), the parts for u < —e and u > s overlap, but we interpret 
it that for u < — e we take the path above the real axis and for u > s below the real 
axis. This path is illustrated in Fig. 9.1. 

Now consider the complex curvilinear integral 

f t£(1 ^ t s ~ 2 dt. 

J /(e.oo) ^ 1 

Since we have to treat t s ~ 2 on /(e, oo), we shall choose a complex power t s for 
jeC. Denoting the argument of t by arg t, we can define a single- valued function 
log t on C — {z = x + iy \ y = 0, x > 0} by 

logf = log |?| + i argf (0 < argf < 2n). 



Fig. 9.1 Path of I(e, oo) 
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Using this, a single-valued analytic function t s on C— {z = x + iy \ y = 0, x > 0} 
is defined by 


t s = e slogt 


We divide the contour I(e, oo) into three pieces as follows. 

Ci : the part of the real axis from oo to e, 

1(e): the circle of radius e with center at the origin (oriented counter-clockwise), 
C 2 : the part of the real axis from e to 00 . 

We have I(e, 00 ) = Ci + 1(e) + C 2 . For t on Ci, we put arg t = 0, and on C 2 we 
put arg t = 2n. Then the integral on Ci is given by 


r te [l ~ a)t s _ 2 _ r e te (] ~ a} ‘ 

JCi e' - 1 r r Jo , o c - 1 

= ~Js e'-l 

and on C 2 by 

r poo f (1 -a)t 

/ t S ~ 2 dt= e (s-2)(\ogl+2ni) dt 

Jc 2 e ‘ ~ 1 Js e'-l 

_ e 2ni(s-2) J 

Noting e 2 jr 'F- 2 ) _ e 2n is ^ together we get 


t s 2 dt 

00 f (1 -a)t 


t s 2 dt 


00 


e< - 1 


t s ~ 2 dt. 


L 


/(e,oo) 


t a )* 

t s ~ 2 dt = (e 2ms 

e' — l 



tg(l a)t 

e' — l 


dt+ 



e (l— a)t 
e'-l 


dt. 


(9.5) 


The circle 1(e) is parametrized as t = ee ,e (0 < 6 < 2n), so on 1(e) we 
have t s ~ l = gs-'g'l 1-1 ) 0 ) and the absolute value of the integral is estimated from 
above as 



e (l -a)t 


e'—l 



e (\-a)ee ie 

— e s - l e i(s - X)6 iee w 

e ee - 1 


dO 


— gRe(s) 



g (l -a)ee ie 

e se>9 - 1 



dQ. 


(9.6) 
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£ 

We see — is bounded as a function of s and 0, so if we take the limit s — > 0 

e se - 1 

then, for Re(s) > 1, by (9.6) we get 

r e tt-a)t 

lim / f s_1 dt = 0. (9.7) 

«-» Jm e '~ 1 

On the other hand, in (9.5), by Cauchy’s integral formula, the curvilinear integral 
on the left-hand side does not depend on the choice of sufficiently small positive 
number s (s < 2 it), for if we take another s' > 0, then in the region surrounded by 
two paths, the integrand is a single-valued holomorphic function. Now we assume 
that Re(,v) > 1. In (9.5), since the left-hand side does not depend on e > 0, we fix 
s on the left-hand side, and on the right-hand side we take a limit as s — > 0. Then 
using (9.7), we get 


L 


/(e,oo) 


t s ~ 2 dt = (e 2nis 

e‘ - 1 v 



te 


(1 ~a)t 


s—2 


dt. 


(9.8) 


Thus we obtain the following curvilinear integral representation of t,(s,a). 

Proposition 9.1. Let e be a positive number with s < 2n. If Re(.v) > 1, then the 
Hurwitz zeta function if (s , a) ( a > 0) has the contour integral representation 




r(s)(e 


2nis . 


1 


te 


1 ) J I(e,oo) ^ 1 


s—2 


dt. 


Through this formula, £(s,a) is continued analytically to a meromorphic function 
on the whole s-plane and has a unique pole at s = 1 of order 1 with residue 1. 


Proof. The first half is immediately obtained by the integral representation (9.4) and 
(9.8) of f(s, a). We shall see the latter half below in turn. We prepare the following 
lemma. 

f te (x ~ a)t 

Lemma 9.2. The integral / f' 2 dt converges absolutely for arbitrary 

Ji(e.oo) e‘ ~ 1 

s G C and gives a holomorphic function on the whole s-plane. 


f te^ _ 9 

Proof. The curvilinear integral / t s “ dt ( j = 1 , 2) converges absolutely 

Jcj e' — l 

for arbitrary s & C, so it is holomorphic there. Also the curvilinear integral 

f te (l ~ a)t 

I t s dt converges absolutely for arbitrary s by (9.6) since s > 0 

J 1(e) 1 

is fixed, and is differentiable with respect to s, so this is also holomorphic on the 
whole s -plane. □ 
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As is well known in complex analysis, 


1 


r(s) 


is a holomorphic function on the 


whole 5-plane, so by Proposition 9.1 and Lemma 9.2, poles of (s. a) result from 
zeros of the function e lllls — 1. Since £( s,a ) converges absolutely for Re(.v) > 1 
and is holomorphic there, possible poles are s = 1 or .v = 1 — m (m e N). Among 
these, at s = 1 — m the function 


1 

r(s)(e l7tis - 1 ) 

is holomorphic. Hence £( 5 , a) has a unique pole 5 = 1 and its order is 1. The residue 
is given by 


lim 


(J - 1) 


/ 

Jill 


te^~ a)t 


. 1-2 


dt 


'-1 r(s)(e 2 *‘° - 1) y /(gi00) e' - 1 

— / 1 1 dt = Res I I 

Tti J 1(e) e ’ — 1 t=0 - 1 / 


= -[ 

2ni J n 


fg(l a)t 


/(e,oo) i - t 1 


dt 


2ni 


1. 


The above calculation of the last integral is due to the residue theorem. The only 
e d ~a)t 

pole of the integrand inside 7(e) is t =0 and it is of order 1, so it is enough 

e‘ — 1 

to calculate its residue. 

We have finished the proof of the latter half of Proposition 9.1. □ 

Next, we calculate the special values of £(s, a) at non-positive integers 5=1 —in 
(m e N) by using the contour integral representation. 

Proposition 9.3. Let m be a natural number. Then for a > 0, we have 

Bm ( tl ) 


£(1 —m,a) 


m 


Here B m (a) is the Bernoulli polynomial defined in Sect. 4.3 (p. 55). 
Proof. By Proposition 9. 1 we have 


£(1 — m.a) 


( lim 


r(s)(e 


1 

2 nis . 




1 ) 7 J I(e, 00 ) ^ 1 


dt. 


f e d a)t _ 

In the last curvilinear integral, the integrand 1 m becomes a single-valued 

e‘ — 1 

holomorphic function on C \ {0}. Notation being as before, in the integral for the 
contour 7(e, 00 ) = Ci + 1(e) + C.f, the curvilinear integrals on C\ and Ci cancel 
each other. That is. 


9.3 The Functional Equation of £ (,s, a) 
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r fptl-alf r 

/ r m ~ l dt + / 

J Cl e' - 1 Jc 2 


te ( 

1 

e l — 1 


-m— 1 


dt = 0 . 


The calculation of the curvilinear integral on 7(e) is done by calculation of the 
residues. By virtue of (5) and (4) of Proposition 4.9 on Bernoulli polynomials, the 
residue of the integrand at t = 0 is given by 


Res 

1=0 


te (1 ~ a)t 
e‘ - 1 



Bm ( 1 o.) ^ | B,„ (a ) 

m ! m ! 


On the other hand, the limit is calculated by (9.3) as follows. 


lim 1 — lim •*(•* + 1) ■ ■ ■ (s + m - 1) 

s-*l—m r(s)(e 27lis — 1) r(s + Itl) ( s + m — 1) — l) 

= (—!)(— 2) ••• (1 -m) 

2 ni 

_ (— l)'" -1 (m - 1)! 

2ni 


So we have 


7(1 — m, a) = (— l) m 1 (m — l)!Res 

r=0 


ftpd-a ) 1 \ 


By this, Proposition 9.3 is proved. 


□ 


9.3 The Functional Equation of £(s, a) 

We fix to < 0, and denote by C(to ) the line on the f-plane defined by Re(f) = to 
oriented upwards: 


C(? 0 ) : t = to + iy (-00 < y < oo). 

We assume Re(5) < 0. In the contour integral representation of the Hurwitz zeta 
function ( (s, a) (Proposition 9.1), we cut out the contour 7(e, oo) as in Fig. 9.2, and 
shift it over the imaginary axis to the line — C(t 0 ). Here we are taking 0 < a < 1. 
When it goes beyond the imaginary axis, it passes the pole t = 2nin(n e Z, 

n / 0) of the integrand 1 , so the residues appear there. The assumptions 

e’ — 1 

Re(s) < 0 and 0 < a < 1 are conditions for the integral to converge. Under the 
condition 0 < a < 1, the curvilinear integral along C(t {) ) converges absolutely. 
Since we can transform as 
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Z(s,a) 


1 


0 (\—o)2ti in n 

-2 ni)Y — - / 

(27 tin) 1 s J C (t 0 ) 


r(s)(e 2nis — 1 ) 


n & Z 


f e (l 4)t 

- 1 


I"- 2 dt 


1 


r(^)(e 2jr,s - 1) 




_j_ e 2 .nina e - r 


n= 1 


{Inn) 


\-s 


-/ 

JCUo) 


e‘ - 1 




taking the limit to — > — oo, we have 
(— jri)(2;r) J “ 1 




e 2 


00 glnina 


/? — 1 


/"(s) sin7T,y 
Here substituting 1 — {a} for a, we get 
(— jri)(2jr) i_1 ^ 


y — 

Z_^ „i- 


e 2 


00 g— Inina 


E 

«=i 


,1-5 


(9.9) 


£(s> 1 - {a}) = 


/"(s) sin 70 


<? 2 


00 e -2nina 


y — 

Z^ „1- 


- e 2 


00 glmna 


EW <») 


n=l 


« = 1 


9.3 The Functional Equation of f (,s, a) 


147 


where the notation {a} denotes the fractional part of a: 0 < {a} < l, a -{a} e Z. 
This transformation is valid for Re(.s) < 0. By these relations (9.9), (9.10), we get 


e™ £{s,a) + e ** £(s, 1 — {a}) 


(2 7t) S 




glnina 

n 1 _f 


(9.11) 


and since t,(s,a) is analytically continued to a meromorphic function on the whole 

°° glnina 

.v-plane, } — is also continued analytically to a meromorphic function. 

' n l ~ s 

n= 1 

Summing up, we get the following theorem. 

Theorem 9.4. We assume that 0 < a < 1. The Hurwitz zeta function £ (,s\ a) is 
continued analytically to a meromorphic function on the whole s-plane. It has a 
unique pole at s = l. It is of order 1 and the residue is 1. Moreover, the following 
functional equation is satisfied. 


Z(s,a) 


(—tti)(27ty 1 
T(s) sin its 



glnina 

n \-S 


oo 



n = 1 


r,— Inina 


,1-5 


If we put s — 1 — m here and use (4.3) on p. 59, we obtain Proposition 9.3. 
Furthermore if we put a = 1 in (9.1 1), noting that £ (s, 1) = £ (s), we get 




(2 it)’ 


2 T(s) cos £ = £ 


x £(1 ~s) 


(2 tt) s 


2 T(s) sin 


x?(l -s). 


Then if we use the duplication formula of the gamma function 

r(s) = n- 1 ' 2 2 s - 1 rQr(^± 


and the reflection formula 


r(s)r(i-s) = - 


sin its 

(for these formulas, see [104, Chapter 1 1]), we get 

( 2 tt y 


r(^)t;(s) = n 1 ' 2 2 1 - 


= it s ->/ 2 


2 T (^)sin^±I> 

r(i^) 
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Adjusting a little, this gives the usual functional equation of the Riemann zeta 
function. 

Theorem 9.5 (Functional equation of Riemann zeta function). The Riemann 
zeta function £(s) can be continued analytically to a me romorphic function on the 
whole s-plane. If we put 


m = tt~ s/1 r (0 £( S ), 


it satisfies the functional equation 


£(y) = £(!-*). 


9.4 Special Values of L -Functions and the Functional 
Equations 

In this section, our aim is to define a Dirichlet /.-function and to look for its 
functional equation and special values. 

Let / be a Dirichlet character modulo /. The Dirichlet L-function associated 
with the character '/ is defined by 



The series on the right-hand side converges absolutely for Re(.v) > 1 and it is 
holomorphic there. 

First, we consider the special values at positive integers. 

The Gaussian sum g( /) associated with / is defined by 


/ 


g(x) = X(a)e 2nia/f 


(Definition 8. 1 , p. 108). If we assume that / is a primitive character modulo /, then 
by virtue of Lemma 8.2, we have 


/ 


X(n)g(x) = X! X(a)e 2nian/f 


(9.12) 


a= 1 


for arbitrary integer n . 


9.4 Special Values of L-Functions and the Functional Equations 


149 


Theorem 9.6. Let x be a primitive character modulo f and let k be a natural 
number. If x(~ 1) = ( — l) fc , then the special value of the Dirichlet L-function 
L(s, /) at s = k is given by 


Lik.x) 


(-1 ) k ~ x {2ni) k 

2kiT k 


g(x)Bk.j- 


Example 9. 7. We give examples first. Let / be the primitive character modulo 4 and 
let k be an odd positive integer. Then we have ~X = X and g(x) = 2 i, so we have 


(-l)NV 

*■(*'*> = B k , x . 

We write two of them explicitly below. By the expression in (4.1) in Sect. 4.2, we 
have B\ x = =»~ and Bj x = |, so we get 


1 1 1 

L(l,x) = 1 --+ +■■■ = 

3 5 7 

1 1 1 

Z.(3, + _ + " 


jr 

4’ 


Tt 

32' 


Next, let x be the primitive character modulo 3. Since we have B\ x 
| ,g(.x) = V3* , we get 


-l*.; 


11111 Tt 

L(1 ’^)- 1 - 2 + 4 - 5 + 7“8 + '-- 


2 2 jt 3 


11111 

L(3, Z)_1_ 2 3 + 4 3_ 5 3 + 7 3_ 8 3 + '" ~~ 3 4 V3 ' 
Proof. We give a proof of the above theorem. Using (9.12), we have 


/ 


uk.x)g(x) = 


00 glnian/f 


a = 1 n = 1 

Noting the assumption /(— 1) = (— I ) k , the right-hand side is transformed into 

f e 2nian/f 


\ J2 

a— 1 ne I 

n t^O 
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By virtue of Theorem 4. 1 1 (p. 59) and (4. 1 ) (p. 54), this value can be rewritten as 


^ ) E X(a)B k {a/f) 

a= 1 


—(2 ni) k 
2k\f k ~ x 


Bk, x - 


Noting g(x) 1 = /(— 1 )g(x)/ f by Lemmas 8.2 and 8.3 in the last section, we can 
derive the theorem from this. □ 

We express L(s, y) by using Hurwitz zeta functions. Hurwitz zeta functions 
are very useful to deduce various properties of /.-functions. Under the assumption 
Refs) > 1, we can transform L(s, y) as 


/ OO 


L(s,x) = EE 


a= 1 m = 0 
/ 


X(a + fm) 
(a + fm) s 


= E * {a) E 


1 


(. a + fm) s 


r-'E.m'E. 


L -‘(m + a/f) s 

a = 1 m= 0 


and the next proposition follows. 

Proposition 9.8. Let x be a primitive Dirichlet character modulo f > 1. Then in 
the range Refs) > I , we have 


l(s, X ) = r s 




Through this expression, L(s , x) > s continued analytically to a holomorphic function 
on the whole s-plane. 

Proof. We have already shown the first half of the assertion. We use Proposition 9. 1 
for the latter half. By Proposition 9.1, the Hurwitz zeta function f(s,a/f) is 
continued analytically to a meromorphic function on the whole s-plane, its pole 
being only situated at s = 1 and it is of order 1. Hence L(s,x ) is analytically 
continued to a meromorphic function on the whole s-plane and it has at most one 
pole at s = 1 of order 1. Calculating the residue at s = 1, we get 

/ 

r l E xw = 0 

a= 1 

since / > 1 and x is non-trivial, so we see it is holomorphic at s = 1 too. □ 
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Let us derive the functional equation of L(s , /) from the functional equation of 
Hurwitz zeta functions (Theorem 9.4). We use the following notation 8 = )'(/), 
defined by 


0 if/(-l)=l, 

1 if *(-!) = -1- 


So /(— 1) = ( — l) 5 , <5 e {0, 1}. 

Theorem 9.9 (Functional equation). Let x be a primitive character modulo f . 
Then L(s , /) satisfies the following functional equation. 


where we put 


W(x) = 


g(x) 
77 i s ' 


Proof. By using Proposition 9.8 and Theorem 9.4, we continue L(s, x) analytically 
to the region Re(s) < 0, and get 


L(s,x) = xwUj) 

a = 1 ' * ' 


(—it i)(2ji) s 1 
f s T(s) SHUTS 


e * 


OO f , , 2nin 4 


,1-s 


EE 


/ \ — 2 Jr in 

X(a)e 7 

n 1 ~ s 


= 1 a= 1 


Here, using (9.12), we get 

j, , (-7T/)(27T) i_1 g(/) 

W>X) = — JTFi x • 

f s l(s) SHUTS 

Simplifying the expression, we get 


(e*? - x(-l)e^) L(l - s,x)- 


rr , ( n(s - 5) ^ g{x)(2it\ s 

r{s) cos — - — jL(s,*) = — L ( 1_J '7)- 


(9.13) 


By using the duplication formula of F(s) and the relation E(^)F(1 — s ) 
we can rewrite F(s) as 


71 

sin tts ’ 
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r(s) = 7t~ l/2 2 s ~ l r | 


E) 

= ji- 1/2 2 s ~ l r | 


\ 

l 2 , 

' r ( l ~ s 2 +s ) cos Ms f S) 


and then substituting this into (9.13) and simplifying the result, we get the desired 
functional equation. □ 

Finally, let us look at special values of L(s , x) at negative integers. 

Theorem 9.10. Let x be a primitive character modulo f . For any positive integer 
in, we have the following formula: 


L( \ — m, x) = — 


B, 


m,X 


Proof. By Propositions 9.8 and 9.3, we have 




~ l E y) 

0=1 ' ' ' 


-r~ x E 


B m (a/f) 


By (4.1) on p. 54, this is equal to — . □ 

m 

Remark 9.11. We note that Theorem 9.6 can be derived also from Theorem 9.10 
and the functional equation (Theorem 9.9). 

Exercise 9.12. Let a be any complex number. Put f(s) = — a). Find a 

functional equation of / (s), that is, give a relation between / (.y) and f(b — s) for a 
suitably chosen number b. 

Exercise 9.13. Let a be a positive integer. Determine the location of poles of the 
function £ (,sj£ (,y — a) and their residues for each case when a is even or odd. 

Exercise 9.14. (1) Let x be the primitive character with conductor 4. Show that 
L(l, x) = § by using the integral 

L 


1 + X 2 


■ dx 


and Abel’s theorem of power series convergence. 

(2) Let x be the primitive character with conductor 3. Show that L(l, /) = 
using the integral 


3vh 


by 
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f - 2 dx. 

J 0 1 + X + X 2 

Exercise 9.15. (1) Let / be any Dirichlet character modulo /. Show that for any 
x e C with Re(s) > 1, the infinite product 

n (! - x(p)p~t i 

p: prime, p\ f 

converges and equal to L(s, '/). 

(2) Show that L(s, /) 7 ^ 0 for any seC with Re(.s) > 1. 



Chapter 10 

Class Number Formula and an Easy 

Zeta Function of the Space of Quadratic Forms 


In this chapter, as an application of quadratic forms and quadratic fields, we 
give an explicit formula of some simple zeta functions, related to some so-called 
prehomogeneous vector spaces. We also prove a class number formula of imaginary 
quadratic fields. Before that, we review the theory of multiplicative structure of 
ideals of quadratic field without proof. 


10.1 Ideal Class Groups of Quadratic Fields 

We first review the prime ideal decomposition of the maximal order of quadratic 
fields. Since there are many good books on prime ideal decompositions of the 
Dedekind domain, we do not give proofs here. For details, we refer to Lang [67], 
Weber 1 [101]. Then we supply the theory of ideals of a quadratic order which is not 
maximal. Since such orders are not Dedekind domains, the theory of ideals is not 
covered by the above general theory, so we explain the difference. 

We fix a quadratic field K of discriminant D k and denote by O max the maximal 
order of K . For any non-zero ideals a and b of D max , we write 


cijbj ; a,- G a. b, G b, v is an arbitrary natural number | 

and call this a product of a and b. It is clear that this is also a non-zero ideal of 
0, ml . We give the following theorem without proof. We denote by xk the Dirichlet 
character associated with K defined in Sect. 6.3. 


'Heinrich Martin Weber (born on May 5, 1842 in Heidelberg, Germany — died on May 17, 1913 in 
Strasbourg, Germany (now France)). 
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Theorem 10.1. (1 ) Any (non-zero) ideal of the maximal order O max of a quadratic 
field K is a product of prime ideals. This factorization is unique. For any ideals 
a and b ofD max , we have N(ab) = N(a)N(b). 

(2) For any prime p e Z, the ideal pD max is decomposed as follows. 

(i) If Xk(p) = 1, then pO max = pip 2 , where pi and p 2 are different prime 
ideals related by p 2 = Pi . Here we put 

Pi = {or. a e pi} 

and x 1 — > x is the non-trivial automorphism of K/Q. 

(ii) If Xk(p) = —1 then pO max is a prime ideal. 

(iii) If Xk(p) = 0, namely if p is a prime factor of Dk, then 

P^Amax — P” 


for a prime ideal p. 

There are no prime ideals of O max other than those which appear in (1), (2) and (3) 
above. 

By this theorem, we have a relation between zeta functions as follows. For a 
quadratic field K , we define the Dedekind zeta function of K by 

Here the sum runs over all non-zero ideals of D max and it converges absolutely for 
Refs) > 1 . The existence and uniqueness of the prime ideal factorization gives the 
Euler product decomposition 


k (5) 

again for Re(y) > 1. Here the product runs over all prime ideals of D max . Also the 
decomposition of primes of Z in K (Theorem 10.1) gives 

= £( s)L(s,Xk )■ 

We shall give a direct proof of this relation later. 

We also give the following theorem on character, again without proof. 

Proposition 10.2. Let K and Dk be as before. Let y he a primitive Dirichlet 
character modulo Dk and we consider the following property on y: 

X(N(a)) = 1 for any ideal a ofD max which is coprime to Dk ■ 
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(1) The Dirichlet character XkW = (— ) is a primitive Dirichlet character 
modulo | D\ 1 1 and satisfies the above property. 

(2) Conversely, any non-trivial primitive Dirichlet character modulo \ D k\ which 
satisfies the above property is equal to Xk- 

Now in the rest of this section, we explain a multiplicative structure of ideals 
including the case of non-maximal orders. 

We fix an integer / > 1 and we consider ideals of an order Dp of a quadratic 
field K = Q (*/m). We extend the notion of ideals so that we can consider a 
group of ideals. For this purpose, we introduce O /-modules, called fractional ideals, 
which are not necessarily contained in Dp. To define this, we first consider a free 
Z-submodule L of the quadratic field K. The rank of L as a free Z-module is at 
most 2. Indeed, if we assume that rankzL > 3, then there are elements to i, coi, 
coi € L which are the part of a basis over Z. If citoi + C 2 CO 2 + C 3 CO 2 = 0 for some 
Ci € Q, then there is some non-zero re Z such that re,- e Z for all i, so we have 
rej = 0 and hence c, = 0 for i = 1, 2, 3. Since L C K and K is of dimension two 
as a vector space over Q, this is a contradiction. 

Definition 10.3. A free Z-submodule of rank 2 in the quadratic field K = Q (*/m) 
is called a lattice of K. 

For lattices L\ and L2, we define their product by 


/. 1 L 2 = ! a i b , ; a, e L\, bj e L2, v is an arbitrary natural number 

This product is a lattice. Indeed, for any element a G K, there exists re Z such 
that ra e Z + Z and applying this to a basis of lattices L\ and L2 , we see 
that rL\, rLi C Z + Z Jin for some integer r. But since Z + Zjm is a ring, we 
have r 2 L\L 2 C Z + Z «Jm. Hence by virtue of the structure theorem for finitely 
generated abelian groups, L\ L 2 is a free Z-module of rank 2 at most. It is obvious 
that its rank is 2, because for any 0 ^ to G L\, we have 00L2 C L\ L2 and (0L2 is 
of rank 2. This multiplication is commutative and associative. 

Generalizing slightly the notion of a proper ideal, we call a lattice L of K a 
fractional ideal of Dp if L is an D /-module. Then we also have DpL = LD / = L, 
since 1 G Dp. So 0/ is an identity element of the semi-group of fractional ideals 
of Dp. In this book, if we say just an ideal of Dp, we always understand that it is a 
subset of D f . 

Now, in order to introduce a group structure on fractional ideals of an order D / , 
we need inverse elements. In other words, for an ideal a, we would like to find a 
lattice L such that aL = Dp. But such lattice does not exist in general. 

Example 10.4. Take K = Q( \f—2) and consider the order O5 = Z + 5 V— 2Z of 
conductor 5. Put a = 5Z + 5\f—T/. Then this is an ideal of O5, but there exists no 
lattice L such that aL = D5. The reason is as follows. If such an L exists, then since 
5 e a, we have 5 L C D 5 . So any element a of L is written as a = c//5 + £>V— 2 
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(a, b € Z), but (5 + 5\/—2)a = (a — 10b) + (a + 5b)V—2 and since this should 
belong to D 5 , we have a + 5b e 5Z, so a e 5Z. So we get L C Z + Z\/^2, 
therefore we have a L C 5(Z + Zv^2) f. Q 5 . Notice in this example that a is not 
a proper ideal. 

Lemma 10.5. Let a be an O / ideal. There exists the fractional ideal L of O / such 
that 0 L = D f if and only if a is a proper D / ideal. If a is proper. L is uniquely 
determined by a. 

Proof. First assume that a is proper and we construct L such that aL = Of 
explicitly. It is sufficient to give it when a is a primitive ideal. Let { 1 , cu} be the 
standard basis of the maximal order of K and express the ideal as a = Z a + Z (d + 
fa > ) in the standard basis. We denote by x i->- x the non-trivial automorphism of 
K/Q , and put M = Z + Z a~ x (d + fed) = a -1 a. This is obviously an Of module 
and hence a fractional ideal of Of. Taking the product, we have 

aM = Za T - Z (d + f od) + Z (d + f co) -F Zn ' ( d -f- f co){d + f (d). 

Here, because a is an ideal, c := a^ x (d + fco)(d + fed) is an integer (p. 80 , 
Lemma 6.3). Moreover, by the assumption that a is a proper ideal, a, b := 2d + 
fTr(co), c are mutually prime (p. 81 , Lemma 6.5). But by the above expression we 
have a, b, c e aM , so 1 e aM . Therefore, by d + fco e aM , we get ftx> e aM , 
and we see £)/ C n M . By the above expression, we have aM C Of, so we have 
aM = O f and we proved the first half. Conversely, let a be an D / ideal such that 
there exists L with aL = O f. Then if an C a for a e K, then aO / = aaL C 
aL = Of, so a e Of. So a is a proper O / ideal. If aM = aN = O / for fractional 
D / ideals M, N, then M = MO / = M(aN) = (Ma)N = O fN = N, so the 
uniqueness follows. □ 

We denote the O /-module M defined above by a " 1 and call it the inverse ideal 
of a. 

We call a fractional O / ideal L a proper fractional O /-ideal when 
{A € K-.XL c L} = Of. 

For any proper ideal a of Of, a -1 is also a proper fractional ideal. For any proper 
fractional ideal L, we have rL C O f for some integer r, and rL is obviously a 
proper ideal of Of, so L has also an inverse M , which is a fractional D / ideal such 
that LM = O f. 

Lemma 10.6. The product L\Li of proper fractional ideals L\ and L 2 ofO / is a 
proper fractional ideal. 

Proof. It is sufficient to prove when L\ and L 2 are (integral) ideals. Let a and b 
be proper ideals and put ab = c. For a e K, assume that ac C c. Since a is a 
proper ideal, there exists a -1 and we have a -1 c = a ~ 1 ab = b. Hence we have 
ab = a _ 1 (ac) C a~'c = b and since b is a proper ideal, we have aeO/. □ 
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Summarizing the above, we have 

Proposition 10.7. The set of all the proper fractional ideals of O f is a group by 
the product defined above with the unit element O /. 

The group defined in this way is called an ideal group of Of. The equivalence 
of ideals is defined for fractional ideals in the same way as the usual ideals, namely 
L ~ L' if L = aL' for some a G K x , and the quotient group of fractional ideals 
divided by this equivalence is called an ideal class group (in the wide sense). The 
class number of Of is the cardinality of the ideal class group of Of. 

Now we give a certain subset of ideals which, while not including all invertible 
(i.e. proper) ideals, does include representatives for the whole ideal class group 
of £) /. 

Definition 10.8. Let m be a natural number. We say that an ideal a of 0/ is coprime 
to m if 


a + mO f = Of. 

Lemma 10.9. An ideal a of O f is coprime to m if and only if the norm of a is 
coprime to m. Any ideal coprime to f is a proper ideal. 

Proof. We may assume that o = lb for a primitive ideal b. By using the standard 
basis, we write b = Za + Z (d + fed). The module a + mO f is obviously an ideal 
of Of, so a is coprime to m if and only if 1 G a + mO /, or equivalently there exist 
x, y, z,w € Z such that l(xa + y(d + fed)) + m(z + wftd) = 1. This implies that 
Ixa + l yd + mz = 1 and lyf + mw f = 0. If there exist such x, y, z, w, then by 
the second equation, we see that ly is divisible by in and hence by the first equation 
we see (m.la) = 1. Since N(a) = al 2 , we have ( m,N(a )) = 1. Conversely if 
we assume that (m. la) = 1, then there exist x, z G Z such that lax + mz = 1. 
Since lax + mz G a + mO f, this ideal is equal to Of and we have proved the first 
assertion. Since / O max C O /, for any ideal a of Of we have 

a+fO/ C a+ f O max c Of . 

So if we assume that a is coprime to / , we have a + f O max = Of. If a a C a, then 
a G O max , so we have 

otOf =D' 0 + f otO max C a + f O max = Of. 

Hence a, G O / and a is a proper ideal. □ 

There exists a proper ideal of D f which is not coprime to /. For example o = 
25Z + 5 V— 2Z C D,- = Z + 5 V— 2Z is not coprime to 5 but is a proper ideal of O 5 . 

In order to consider classes of proper D / ideals, it is enough to consider ideals 
coprime to / by the following lemma. 
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Lemma 10.10. Let rn be an arbitrary natural number. For any proper ideal n of 
D /, there exists a proper ideal of O / which is equivalent to a and coprime to m. 

Proof. It is enough to prove it when a is a proper primitive ideal. We write a 
primitive quadratic form corresponding to the class of a by 

( a b/2\ 

(b /2 c )■ 

It is enough to see that we can make a prime to m by changing the quadratic form to 
an equivalent quadratic form. Let S be the set of all primes which divide m and put 


51 = {p e S;p \ a}, 

5 2 = {p e S\ p\a and p \ c}, 

5 3 = {p e S\ p\a and p\c}. 

Put x = rUs 2 P an d y = U peSl P- Then we have (x, y) = 1, and if we put 
/ := ax 2 + bxy + cy 2 , then l is coprime to m. Indeed, if p e Si then ax 2 is 
coprime to p , but p\(bxy + cy 2 ), so p \ l, and if p e S 2 then p\(ax 2 + bxy) but 
p \ cy 2 , so p \ I .If p & S 3 , then since we assumed first that a, b, c are coprime, we 
have p \ b, and since p \ xy, again / is coprime to p. Taking a matrix A e S7,2(Z) 
whose first row is ( x , y), we have 


A 


( a b/ 2 \ 
U/2 c ) 


‘A = 



so we are done. □ 

We denote by 7o(£> /, /) the set of ideals of Of which are coprime to / and by 
To (O maJC , /) the set of ideals of D max which are coprime to /. 

Lemma 10.11. The mapping cp '■ Io{O max , f) — > Io(D f, f) defined by (f>{a) = 
a IT Of is a bijection. This maps a product to a product and does not change the 
norm. 

Proof. First we show that this mapping does not change the norm. Let a be an ideal 
of D max coprime to /. By the isomorphism theorem of rings, we have O / /a IT 
0/ = (Of + a) /a. Since we assumed here that a is coprime to /, we have a + 
fO max = O max , but since fO max cO/,we also have Of + a = O max . Namely 
we get 


\O f /an Of | = \O max /a\, 

and this means that the norms are equal. In particular, we have seen that the 
image of the mapping is contained in Iq( 0 f, f) since the norm is coprime to /. 
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The surjectivity of the mapping is proved as follows. For ao e Io(0 /, /), the O max 
ideal aoO max is coprime to /, since 1 e 0/ = Oo+ /D/C QoD,„ ax + f O max . 
We have 


a 0 C a 0 D max n 0/ = (aoD max n D/)(a 0 + f Of) 

C a 0 + a 0 /D max C cio + aoD / — ao- 

So we have 0(aoD max ) = ao- Injectivity is proved as follows. If we write 0(a) = cto 
forafixeda e /(/ O max ), then as we have shown above, we have cp(aoO max ) = ao- 
We also have N(a 0 O max ) = N{a 0 ) = A/a). Since a 0 D maA - C (a Cl O f )O max C a 
and N(a) = N(aoO max ), we have o = aoD max . This means that a is determined by 
cio = 0(a). So the mapping is bijective. Now take ideals a, b e Io(O max , /)• We put 
ao = 0(a) = a n D/ and bo = 0(b) = b fl D/. We will show that 0(ab) = aobo- 
First we show that aobo is coprime to /. Since ao is coprime to / as shown before, 
we have ao + /D/ = O/, so aobo + /bo = bo- Since bo is also coprime to /, 
we have aobo + /bo + /D/ = bo + /D/ = 0/. Since /bo C /O/, we have 
do&o + /b + / D / = aobo + /O/. So we have aobo + /D/ = Of. So aobo is 
coprime to /. Now we put Co = aobo- We will show that Co = ab fl D/. We have 
already shown that a = <XoD max , b = boD maA and Co O max IT D/ = Co- We have 

a 0 b 0 D 

max — a 0 D 

max b 0 D 

max — ab, 

so for ab e Iq (/ O max ), we have 0(ab) = aobo- Therefore a product is mapped to 
a product. □ 

So in order to consider the classes of proper D / ideals, we can lift proper D / 
ideals prime to / in each class to ideals of O max and consider these. But this does 
not mean that the classes of O max ideals and of D/ ideals are the same. In fact, for 
a principal ideal aO max , the ideal 0/ fl aO max is not necessarily a principal ideal 
of Of and this causes the difference. Also we should note the following point. 
We denote by 7(D/,/) the group of fractional proper Of ideals generated by 
7o(D /, /). Then a principal ideal aD / (a. e K) is contained in 1(0 /, /) if and 
only if we can write a = /3/y by numbers /, y e Of which are prime to /. 
Here, we may take y as a rational integer prime to / since a = j J >y / N(y). We 
denote the set of such principal ideals by P(0 /, /). In the case when a e K is 
not in D /, even if (the numerator and the denominator of) N(a) is prime to /, we 
cannot conclude that aD / is in P(0 f, /). Now, obviously an element e Of 
is prime to / if and only if /I — a e / O max for some integer a such that a is 
prime to /.So if we denote by Pz(O max , f) the group generated by principal 
ideals P0 max , where / e O max and / — a e f O max for some integer a prime 
to /, then P(0 f, f) is lifted to this group, and we see that the ideal class group 
of Of is isomorphic to I(O max , f )/Pz(O max , /). If we denote by P(O max , f ) 
the group generated by integral principal O max ideals prime to /, then by counting 
P(O max , f)/Pz(O max , /), we can calculate the difference between h(f 2 DK) and 
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This calculation is not so difficult and is explained for example in [101] or 
[67], but we omit it here. In next section, we use a different method to obtain this 
difference in the case of imaginary quadratic fields. 


10.2 Proof of the Class Number Formula of Imaginary 
Quadratic Fields 

We prove that the class number of an imaginary quadratic field can be written in 
terms of a generalized Bernoulli number. There are many ways to prove this; here 
we use properties of theta functions. 

Theorem 10.12. Let K be an imaginary quadratic field , Xk be the character 
corresponding with K defined in Sect. 6.3 and w be the half of the number of roots 
of unity in the maximal order O max of K. Then the class number ofDx is given by 

h(D K ) = -wB hXK . 

For the proof, we review the transformation formula of theta functions. Let a, b, 
c be integers, and we assume that the quadratic form Q (x, y) = ax 2 + bxy + cy 2 is 
positive definite. We define Q~ l by Q~ l (x,y ) = 4(4 ac — b 2 )~ 1 (cx 2 —bxy +ay 2 ). 
We put 


ft = [r e C; the imaginary part of x is positive} 
(the upper half plane). For r e ft, we write 

0q(t) = e niQ(x ’ y)z . 

x,y€. Z 


It is easy to see that this converges absolutely and uniformly on compact sets in f), 
and is a holomorphic function on fj . 

Proposition 10.13. For ref), the following transformation formula holds. 


6q-1 



V 4ac — b 2 (r/2 i) 6 q{ r). 


Proof. A general proof can be found for example in [3, p. 25]. Here we give a direct 
proof restricted to this case. We use the Poisson 2 summation formula which is well 


2 Simeon Denis Poisson (bom on June 21, 1781 in Pithiviers, France — died on April 25, 1840 in 
Sceaux, France). 
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known in the theory of Fourier 3 transforms. To simplify the notation, for arbitrary 
complex number z G C, we write e(~) = e 2nn . For a > 0, r e f) and £, /* e R, the 
following formula is well known: 

* 2^ 


Here as the square root of —i ta (whose real part is positive since r e Jj), we take 
the branch s/a > 0 for r = i . When we put 

/(£.»?) = e(2~'Q(/r))T), 


s 

J — c 


e(2-'ar^)e(-^r)^? 


e(— (2ro!) 


v= 


■ira 


the Fourier transform of this function with respect to the variables (£, r\) e R 2 is 
calculated as follows: 


/(fW) 



f(/ /])e(-££* - r]if]*)d^dr] 

e(2 -1 r (a(/ + ( b/2a)rj ) 2 + (c - b 2 /4a)r] 2 ))e(-^* - rjr]*)d^dr ] 

e(2 -1 r (a£ 2 + (c - Zr/4a)t7 2 ))e(-i;£* - r](r)* - {b/2a)%*))d% dr] 

1 


s/^-ira sf^/c/c^tr/Aa) 

xe(— (2ra) -1 ^* 2 )e(— (2r(c - Zt 2 /4a)) _1 (jj* - (b/2a)%*) 2 ) 


2 i 


x e 


(ar,* 2 -br,*^ + cf* 2 ) 


* 2 


7(4 ac - Zt 2 ) V 2.x (ac — b 2 /A) 


-)■ 


By this result and the Poisson summation formula (e.g. [102]), we get 

|,i;6Z 

Hence we get the transformation formula of theta functions. □ 

Proof of the Class Number Formula. We show this by calculating the residue 
of a zeta function in two different ways. Namely, we can derive the formula by the 
relation %k(s) = £(s)L(s, Xk ) and the following calculation: 


3 Jean Baptiste Joseph Fourier (bom on March 21, 1768 in Auxerre, France — died on May 16, 1830 
in Paris, France). 
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lim(s - = 

S ~+ 1 


xh(P K ) 

Wy/\D K \ 


and 


L{\,Xk) = 


g(x)*i p _ B Uj[k tt 
\Dk\ Uk S\ ~d7\’ 


lim(j-l)£(j) = 1. 

S~+l 


The latter two equalities are in Theorems 5.4 and 9.6. The point here is the 
evaluation of 1 im s _>. | (,y — 1 ) £> (s) . We fix an ideal class of Q max and denote by C the 
set of all (integral) ideals of O max belonging to this class. We define the following 
zeta function which is a part of I,k(s)'. 


S(s,C) 


E 

o ec 


1 

N(a) s ' 


Lemma 10.14. Independently of the choice ofC, we have 


lim (s — l)£(.s, C ) = 

s—>\ 


n 

Wy/\D K \ 


Proof. If we take a fractional ideal L which is equivalent to some ideal in C , then 
all the other ideals in C are written as La. (a e K). Since La C we have 

a e L~*. We can choose L so that L _1 = a C O max by dividing L by some 
rational number if necessary. Here we can assume that a is a primitive ideal without 
loss of generality. Also we have La = Lf> if and only if a = eft for some unit e of 
ID max- Since we assumed that K is imaginary, 6 is a root of unity in K. Hence we 
have 


f(*,C) = 


N(aY 1 

2 w N(a) s 

a€a v 7 


We write a = Za + Z(d + co) using the standard basis and put N(d + a>) = ac , 
b = 2d + Tr(co) as usual. Then for a = xa + y(d + co) we have N(a) = 
a(ax 2 + bxy + cy 2 ). Since A' - (a) = a, we can cancel a s in the numerator and the 
denominator. So we have 




1 

2w 


y, (ax 2 + bxy + cy 2 ) s . 

x,yeZ; (x,y&( 0,0) 


We have also Dac — b 2 = D k \ . Now our aim is to calculate the residue of this 
function at s = 1. We use theta functions. Since i)(s. C ) converges absolutely 


10.2 Proof of the Class Number Formula of Imaginary Quadratic Fields 


165 


uniformly on compact subsets of {s € C; Refs) > 1}, exchanging the summation 
and the integral, and also using the following well-known formula of the gamma 
function 


r<s) _ r e - t (t_Yd 1= r° 

n s J 0 e W t Jo 


e~ n, t s ~ l dt 


we get 


1 n OO 

r(s)S(s,C) = — Y e - (a ^ +b ^ +cr > 1)y y s ~' dy 

2 w Jo ■“ 

(f.i)) ez2 — (0,0) 

ji s r°° 

= 2 -I {0 Q (iy)-\)f- l dy. 


Here OqUv) — I is a rapidly decreasing function for y — > oo, so the integral 

/ OO 

(0Q(iy) - dy 

converges for all .v e C and is a holomorphic function on the whole complex 
.s-plane. On the other hand, if we take (0, 1) as the interval of integration, it might 
not converge around 0 when s is small. In order to deal with this problem, we use 
the transformation formula of theta functions. Namely, if we substitute y ~ 1 for y in 
the integral on the interval (0, 1 ), in the range Refs) > 1 we have 


f\do(iy)-l)y s - l dy 

JO 

/ oo 

(6 Q (iy~ 1 )- Oy~ s ~ l dy 


/ OO 

(0Q-i{iy)2yY\D K \~ l - l).y _I_1 


dy 


= J 2(0q-i (iy) — l)y Y\ d k\ 1 + 2 y s Y\ d k\ 1 - k 1 1 dy 


-J 


(0 Q -i(iy) - 1) 


: dy — 


Y\d7\ Y\d7\o-o 


i 

s ’ 


The integral in the last expression converges for arbitrary .v, so it is holomorphic 
on the whole complex plane, and the rest has a pole at .s' = 0 and s = 1 . So the 
residue of ^(.v. C ) at ,s = 1 is given by 

1 1 27T 7 r 

H(l)2w s/Dk Wy/\D K \ 


Hence the lemma is proved. 
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Therefore we have 


lim (s - 1 )£jr(s) = 

s— >1 


h(D K )it 

W\/\Dk\ 


Comparing this with the value of L(l, /k), we get the class number formula. □ 

Since £( s ) and L(s, y) have functional equations, C,k(s), which is a product of 
these two, also has a functional equation and it is easy to calculate its explicit form. 
But as a matter of fact, if we use the above integral representation, we can obtain the 
analytic continuation and the functional equation of £(s, C), and through this we 
can get an alternative proof of the analytic continuation and the functional equation 
of £jr(.s). Namely: 

Proposition 10.15. Let C be an ideal class of the maximal order O max of a 
imaginary quadratic field K, and let the other notation be the same as above. 
Then £( s,C ) can be continued analytically to a meromorphic function on the 
whole s -plane. If we put %(s,C) = (2 tx)~ s \ Dk\ s ^ 2 T(s)^(s.C), then it satisfies 
the following functional equation. 


£U -s,C) = tts*C). 


In the same way, if we put %k(s) = (27 r) s \ D k\ s T (s)^k(s), then we have 


£*(!-•*) = Zk(s). 


Proof. For the proof, we use the integral representation mentioned before. For 
Re(.s) > 1, we have 


(2w)n~ s r(s)^(s,C) 


r oo 2 rOO 

), Wy)-V/-'Jy+ 7t =j/ («<»-'(■» 

2 1 

vTDjti(i-j) 5 


1) y~ s dy 
( 10 . 1 ) 


but the integral on the right-hand side is holomorphic on the whole .v-plane. Flence 
the right-hand side is a meromorphic function on the whole complex ,s-plane. So the 
analytic continuation is proved. 

The functional equation is shown as follows. Multiplying both sides of the 
integral representation before by 2~ s \ D k \ s ^ 2 and changing s — > 1 — 5 , we get 


(2w)^(l-s,C) 

= (2w) \D k \ ( !~'0/ 2 (2jt ) s_1 r ( 1 - s)U 1 - .S', C) 

/ oo 

(0 Q (iy)-l)y- s dy 
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/ oo 

(d Q -i(iy)-l)y s - 1 dy 

2 s \D k \ \Dk\~ 1 ' 2 2 s ~'\ D k | (1 - s) / 2 

5 1—5 

= r\D K r ' 2 J™(6 Q (iy) - l)y~ s dy 

s— 1 




1 2 — 1 1 Z> *' 1 1/2 ' 


+ 1 {0 Q -i(iy)-l)y s 1 dy - j— ^ — J. (10.2) 


Here the integrals in the parentheses are almost identical to the integral expres- 
sion (10.1) of £( 5 , C) if we exchange Q and Q~ l . We defined 

-1 4 , . 

Q (wj) = T 7T7( cx ~ bx y + °y )- 

\ d k\ 

and the quadratic forms ax 2 + bxy +cy 2 and cx 2 — bxy + ay 2 are equivalent since 
it is the same if we replace (x, y) — > (— y, x). Since we have 

(Q~'(x, >>))“' = 2~ 2s \D k \ s (cx 2 - bxy + ay 2 )~ s 

we see in the same way as before that 

pOO 

(2 w)r(s)£(s, C) = 2 2s 7t s \D K \- s / (0 Q -i(iy) - l)/" 1 dy. 

Jo 

Just in the same way as in the proof of (10.1), we have 

p OO poo 

J (0Q-i(iy) - Oy s ~ l dy = J (d Q -d.iy) - \)y s ~'dy 

2 -‘vTd 


+ 2 - 


/ OO 2 ~ 1 

(9o(iy) - 1 )y~ s dy y 


k\ _ 1 

5 


(If we note that the discriminant of Q 1 is 1 6 1 /) *- 1 1 , we see the symmetry to the 
case Q.) Therefore comparing this with (10.2), we have 

poo 

(2w)£(l -s,C) = 2 s \D k \ ~ s / 2 / (0 G -i (,» - l)/" 1 dy 

Jo 

p oo 

= (2jt)- s \D K \ s / 2 2 2s \D K r7T s (e Q -diy)-\)y s -' dy 

= ( 2 w)( 27 t)- s \D K \ s/ 2 r(s)i;(s,C) = (2 wMs.C). 
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Hence the proposition is proved. The functional equation of (s) is obvious from 
this. □ 

As for the formula for the class number h(f 2 D[c ) of the ideal class group of a 
general order Of, it is easiest to compare it with the ideal class group of O max by 
a group theoretic method. (For example, see Weber [101, p. 366], Lang [67, p. 95], 
Zagier [106].) But here we introduce a different method. 

We first give the formula for the zeta functions of Of ideals for general / > 1 . 
Here we do not assume that K is imaginary, since the theory is the same for real 
quadratic fields. 

We fix a natural number / and consider the zeta function of ideals of the order 
Of = Z + Z fco of K, which is defined by 


S{s,f 2 D K ) 


E 


i 

N(a) s ’ 


where the sum runs over arbitrary ideals of Of, which are not necessarily proper. 

Proposition 10.16. (1) For a positive integer a, let r(a ) be a number of residue 
classes modulo 2 a represented by integers b such that b 2 = f 2 Dic mod 4 a. 
Then we have 


OO 

Z(s,f 2 D K ) = l;(2s)J2r( a )a~ s - 

a= 1 


(2) More precisely, we have the next relation. 


£0, f 2 D K ) = S(s)L(s,Xk) PI 


P\f 

P m \\f 


\_ p (m+m-2s)_ XK ^ p -s ^ _ p m(l-2s)j 
1 — p l ~ 2s 


Here we denote by m the highest power of p dividing f (hence m depends 
on p). 

Proof A primitive ideal of Of is written as n = Za + Z (d + fco) (a = N(a)) 
and there exists an integer c such that N(d + fco) = ac, so if we put b = 2d + 
fTr(co) then b 2 = f 2 DK mod 4 a (p. 79, Lemma 6.2 and p. 80, Lemma 6.3). For 
positive integers a, a' , we have Z a + Z (d + fco) = Z a 1 + Z (d' + fco) if and 
only if a = a' and d = d' mod a. So if we fix a, two primitive ideals which have 
the above bases are equal if and only if b = b' mod 2 a for b = 2d + fTr(co) 
and b' = 2d' + fTr(co). Besides, if b 2 = / 2 Hat mod 4 a for some integer b, 
then since Tr(co) 2 — Dk = 4 N{co), we have b 2 = ( fTr(co )) 2 mod 4 and there 
exists an integer d such that b = 2d + fTr(co). So we can define a primitive 
ideal Z a + (d + f cofL. Taking ideals which are not primitive into account, (1) is 
proved. Next we show (2). For this purpose, we should just calculate r(a). Here 
we introduce a local version of r(a). For a positive integer a and a fixed prime p. 
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we denote by e the maximum non- negative integer such that p e \a. If p is odd, we 
define r p (a) to be the number of integers b mod p e such that b 2 = f 1 D k mod p e . 
If p = 2, we define r p (a) = r 2 (a) to be the number of integers b mod 2 e+1 such 
that b 2 = f 2 D K mod 2 e+1 . Obviously we have r p (a) = r p (p e ). By the Chinese 
remainder theorem, we have r(a) = n p | a r p ( a )• So we have r(a)a~ s = 

n r p(P e ) P~ es and ^ i s enough to calculate the above local Dirichlet series 

for each prime p. First we assume that p is an odd prime. Let / = p m fo, where 
/o is a positive integer prime to p. We consider the congruence equation b 2 = 
p 2m fo b>K mod //' with unknown h. If e < 2m then b 2 = 0 mod p e . Hence if 
e = 2eo here, then b = 0 mod p e ° so r p (p e ) = p e / p e ° = p e °. If e = 2eo + 1, 
then b = 0 mod p eo+1 , and r p (p e ) = p e / p e ° +1 = p e ° . So we have r p (p e ) = p e ° 
in both cases. Now we assume that 2m < e. Then we have b = p m bo and b% = 
Dk/q mod p e ~ 2m . If p\Dx then p divides Dk precisely once, so if e — 2m > 2, 
there is no such bo. Hence it can have solutions only when e — 2m = 1, and in this 
case, the congruence implies nothing but p\bo, so p" ,+ 1 1 b is the condition. But we 
are counting b mod p 2 ' n+1 , so we have r p (p 2,,,+1 ) = p m , and we have r p (p e ) = 0 
for e > 2m + 1. If p \ Dk, then the congruence equation b ^ = Dk / 0 2 mod p e ~ 2 "' 
has solutions only when Xk(p) = 1 and the number of solutions mod p e ~ 2m is two 
in that case. Since we must count these as residues mod p e ~ m , the number is 2 p'” . 
After all, if p \ Dk ande > 2m, we have r p (p e ) = p m (\ + Xk{p)), independently 
of e. Summing up, we have the following formula for r p (p e ). 


r p(p e ) 


p^ e ^ 2 ^ 0 < e < 2m, 

(1 + Xk(p))p"’ 27 n + 1 < e and p \ D K , 

p m e = 2m + 1 and p\Dk, 

0 2m + 2 < e and p\Dk- 


(10.3) 


Here [e/2] denotes the biggest integer which does not exceed el 2, that is, we put 
[e/2] = eo if e = 2eo or 2eo + 1 for some integer eo- Also when p = 2, we 
obtain the same formula (10.3) by similar consideration. For example, assume that 
0 < e < 2m. If e = 2eo is even, then b 2 = 2 lm / qDk mod 2 e+1 means that 
b = 2 eo b 0 and /)/ = fj D k mod 4. This last relation imposes the condition that 
bo is even or odd. So the number of solutions b mod 2 e+1 is 2 eo+1 /2 = 2 e °. If 
e = 2eo + 1 is odd, then 2eo + 2 < 2m, so b = 2 e ° +1 bo- Again the relation b^ = 
D k mod 2 tells that bo is odd or even. So the number of solutions b mod 2 2eo+1 
is 2 2eo+2 ~ (eo+1> /2 = 2 e °. The case e > 2m + 1 is calculated similarly. Here we 
note that if Dk is odd, then xk( 2) = 1 if Dk = 1 mod 8 and xk( 2) = —1 if 
Dk = 5 mod 8, and that the congruence equation x 2 = y mod 2 ,+1 with I > 1 
for a fixed y and unknown x has no solution when _y = 5 mod 8 but two solutions 
x mod 2 l+l when y = 1 mod 8. We should also note that, by definition, if Dk is 
even then Dk = 0 mod 4, and if Dk = 4 mod 8 then Dk / 4 = 3 mod 4. The rest 
is the same and is omitted here. By using (10.3), we see 
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J2 r p(p e )p~ es = 

e=0 


(1 + p~ s ) - (1 + p>-')p('- 2 >Km+\) 

1 — p l ~ ls 


+ 


ptn(\—2s)—s 

(1 + XK(P))p m(1 ~ 2s) - S 


1 — p s 


if p\D K , 
if P \ D k . 


The last term written for each case p\D k or p \ Dk separately has a unified 
expression 


(1 + Xk(p))P MI ~ 2s) - s 
1 - Xk(p)P~ s 

Reducing to a common denominator, the right-hand side becomes 

(1 + p~ s )(\ - p ( m +W-2s) _ XK (p)p~ s ( 1 - p m( '- ls) )) 

(1 ~Xk(p)p~ s )(1 ~ P l ~ 2s ) 

Hence multiplying this by the Euler -factor of £(2 s), we get 


e=0 


1 _ p (m+l)(l-2s) _ XK (p)p- s (\ - pM\-2s)^ 

(1 - /r^Ml - Xk(p)p~ s )( 1 - p'~ 2s ) 


□ 


Next we put 


But this time, we define the above sum so that a runs only over proper ideals of D f . 

Here we introduce the Mobius 4 function //(/«). This is a function of natural 
numbers to {0, ±1}, defined as follows: 

( 1 ) / 2 ( 1 ) = 1 . 

(2) When the prime factor decomposition is given by m = p\ ■ ■ ■ p r with different 
r primes, we define jMm) = (— l) r . 

(3) When m has a square factor, that is, if m is divisible by a square of a prime, we 
define /i(m) = 0. 


4 August Ferdinand Mobius (born on November 17, 1790 in Schulpforta, Saxony (now Germany) — 

died on September 26, 1868 in Leipzig, Germany). 
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If m i and m 2 are coprime natural numbers, then we have [i(m\)p.(m 2 ) = /x (m i/?? 2 ) - 
The next formula is known as the Mobius inversion formula. 

Lemma 10.17. Let F(n ) and G(n) be functions defined for all natural numbers n. 
We assume that, for an arbitrary natural number n, the following equality holds. 

F(n) = Y J G(d). 

d\n 


Then for an arbitrary natural number n, we have 

G(n) = (^) Fi - d) - 

d\n 

The proof is easy and is omitted here (see [83]). 

Proposition 10.18. (1) Using the Mobius function pt, we have 


^/ 2 d k ) = 

e\f 

i;(s,D f ) = f- s ^2n(f/d)d s t;(s,d 2 D K ). 
d\f 


(2) The next relation holds. 

S(s,8f) = £( s)L(s,xk ) 

r-r (1 - p~ J )(l ~ Xk(p)P~ s ) ~ p m ~ l - 2sm ( 1 - p l ~ s ){x K (p) - P X ~ S ) 

^11 1 yl— 25 

p\f y 

P m \\f 

Proof. The first equality of ( 1 ) comes from the facts that any ideal of D/ is a 
proper ideal of D,. for some e\f and that if the ideal is regarded as an ideal of 
O e , then the norm becomes / /e times the original one. The second equality follows 
from the Mobius inversion formula, applying it to F(f) = ffiis, f 2 Dk) and 
G{e) = efi(s. Of). In order to show (2), we calculate the right-hand side of 
the second equality of (1) by using Proposition 10.16. Since the Mobius function 
is multiplicative, it is enough to calculate the Euler ^-factors. Here when / = 
n i=1 p" h » by the definition of the Mobius function, we can assume that d runs only 
over the numbers J~[ p" 1 ' 1 with j, = 0, 1. So if we write 


! _ ^(m+Dd-2,) _ ( } ? m(l-20) 

f( m , P ) = 


1 — p l 2s 
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then we have 

Z(s,Of) = %( s )L(s, xk) J - [ (/(wt, p~ s ) — p~ s f(m - l,p~ s )). 

p\f 

Calculating the latter product, we get the result. □ 

Remark 10.19. The explicit expression of the zeta function in Propositions 10.16 
and 10.18 was given in [54, 106], (See these papers for applications.) 

Now, when K is an imaginary quadratic field, we obtain the class number of D / 
from the above formula for 0 Os, O /). As before, we denote by C an ideal class of 
proper ideals of D/ and write 




E 

a SC 


1 

may' 


Let vi ’ f be half the number of roots of unity in Of. For a primitive quadratic form 
Q(x, y) = ax 2 +bxy + cy 2 with discriminant f 2 DK corresponding to C, we have 

(2w/)£ (s, C, Of) = ^ (ax 2 + bxy + cy 2 )~ s . 

Ot,>>)sz 2 -(0,o) 


This is proved in the same way as in the case of the maximal order. If we use the 
transformation formula of 0q(t), the residue at ,v = 1 of this Dirichlet series is 
obtained in the same way as before, independently of C , and given by 


lim(.y — l)£(s, C, D /) 

S-+ 1 


71 

Wff V\Dk\ 


By this, we have 


lim (s — 1)C(5,D/) = h(f 2 D K ) — - ■ 

S ~* 1 Wffy/\D K \ 

On the other hand, using the Euler product expression in the formula for £(s,D /■) 
in Proposition 10.18, we have 


lim Os - lK(,s,L)/) = dim Os - 1 )£*($)) x F[ (1 “ Xk(p)p ') 

s—> 1 s — ► 1 A A 

p\f 


Tch(P K ) 
Wy/\D K \ 


nd _ xk(p)p ')■ 

p\f 
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Hence we get the class number formula 


h(f 2 D K ) = h ^ K) x / ]""[(1 - Xk(p)P *) 

vi; / vt; s x A 


as was given in Chap. 6, Theorem 6.12 (2). 

The class number formula for a real quadratic field K (or for indefinite quadratic 
forms) is fairly complicated compared with this, and it is remote from our standpoint 
from Bernoulli numbers, so we omit it here. 


10.3 Some L-Functions Associated with Quadratic Forms 

We introduce here one of the simplest example of zeta functions associated with so- 
called prehomogeneous vector spaces. The content of this section is based on [47], 
Let p be a prime. We assume that p = 3 mod 4. We define a Dirichlet character 
i j/ modulo pby \jf(a) = ( j'j. From this, we define a function xfr on the set L* of 
all 2 x 2 half-integral symmetric matrices as follows. First, for 



if T mod p = 0 or det(T) ^ 0 mod p, we define i jr(T) = 0. If T mod p ^ 0 and 
det(T) = 0 mod p, then there exists g G GL?(Z/ pZ) such that 



Indeed since the rank of T mod p is 1, there exists g such that the second column 
of Tg mod p is 0, and so is the second column of 1 gTg, but since this is symmetric, 
the (2, 1) component is also 0. We define \]/(T ) = \fr( e ) for such T. This is well 
defined since if we assume that 




e GLiiZ/ pZ ), then we have e' = x 2 e (and z = 0, 


fore, e' ^ 0 mod p and 


x g (z/pzr). 

We denote by L* + the set of all positive definite half-integral symmetric matrices 
in L* . Recall that T\ and 73 G l* are said to be equivalent if we have ' gT\g = 77 
for some g G S Lo(Z). We define an L-function of L* with character i Jr by 
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L(s,L* + ,f) 


E 

Te.L* + /~ 


t(T) 

e(T) det(T) 1 ' 


Here L*./ ~ denotes a complete set of representatives of L * up to the equivalence 
and e(T) the order of the finite group {g e SL 2 CL)', 1 gT g = T \ . (This L-function 
was defined first by K. Hashimoto in relation to dimension formulas of automorphic 
forms. This is also an example of L-functions of prehomogeneous vector spaces. 
As we see later, 1 fr can be regarded as a character of some ideal class group, so the 
term “L-function with character \fr ” is not strange.) 

Let us try to calculate an example. If T = ( ° <= L* and rank (7’ mod 

\b/2 c ) 

p ) = 1, then \f(T) = \f(a) if p \ a and \jr(T) = ijs(c) if p\a. Indeed if p\a, then 
by the condition on the rank, we have p\b and c e (Z/ p Z) x , and if p \ a, then we 
have 


/ 1 C)\ / a b/2 

\—b/2a 1 ) \b/2 c 


1 — b/2a\ _ la 0 \ _ f a ()\ 

0 1 )~ l 0c -^J = Vooj 


mod p. 


In the case p = 3, for det(2L) = 3, 12, 15, 24, 27, . . . , we have 



remaining T with rank (7" mod 3) = 1. From this we get 


L(s,L* + ,f) 



+ ... 


2 2s-i ' ! ' 2 2s ~ l 

3.V+1 35+1 + 33s 


Theorem 10.20. This L-function is essentially the Riemann zeta function. More 
precisely we have 


L(s.L* + ,xf) = - 


2 2s ~ l 


^1,1 (/ 


P s 
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In particular, for an arbitrary natural number m, we have 


L{ 1 — m, L* + , i/r) 


2 2 " 




Before proving this, we prove a lemma. For an imaginary quadratic field K, its 
discriminant Dk and a positive integer /, we put d = f 1 D k and 


T(d) = 



4 ac — b 2 = —d 


Here ]_ff pr,m j s the subset of L*_ consisting of primitive ones. We also put S(d) = 
V(d)/ 

Lemma 10.21. We assume that d(< 0) is divisible by p. 

(1) IfQ(Vd) ± Q (J=p), then £ $(T) = 0. 

TeS(d) 

(2) IfQ(Vd) = Q (y/—p), then f(T) = 1 for any T e V(d). 

Proof. We first see that f(T) is the value of if at the norm of the ideal correspond- 
ing to T if rank(T mod p) = 1. In this case, by definition, there exist integers x, 
y coprime with each other such that f(T) = f(e) for e = ax 2 + bxy + cy 2 . 
Since 4 ae = ( lax + by) 2 + (4 ac — b 2 )y 2 and since we assumed that 4 ac — b 2 
is divisible by p , we have i J/(ae) = i/r(2ax + by) 2 . By assumption, we have 
p \ e, and replacing by an equivalent one if necessary, we may assume that 
p \ a. Hence we have t/r(2 ax + by) 7 ^ 0 and f(2ax + by) 2 = 1, so we 
have 1 J/{a) = i//(e). This is the value at the norm. Therefore, since we have 
Xk = f if K = Q (Vd) = Q(V =p), we get f(T) = 1 by Proposition 10.2 
and Lemma 10.11. If K = Q (Vd) 7 ^ Q(^/—p). then xk 7 ^ it- So 1 fr(T) takes —1 
on some elements of S(d) again by Proposition 10.2. That is, 1 // is a character of 
the group S(d) which is not trivial, so the sum over the group becomes 0. □ 

Proof of Theorem. First, it is obvious that ir(eT) = ir(e)ir{T) for any integer e 
prime to p and any T € L* + , and since c(eT ) = e(T), we reduce the L-function to 
the sum over primitive quadratic forms. Namely if we put 


L prim (s,L* + ,f) = Y, 

T€L*/ rim p 


jt(T) 

e(T) det(r)*’ 


then we have 
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L(s,L* + ,f) 


E E 


TeL , 


f(e)f(T) 
e(eT) det(e7’) i 


e= 1 

= L(2s,f)L prim (s,L* + ,^). 


Here L(2s, \ fr) is the Dirichlet L-function. Next, if we apply the previous lemma to 
L + P " m , only the part for T with 4det(7’) = —d = f 2 p (/ are positive integers) 
remains alive. Since e(T) has a common value for all the elements in V(d ), we 
denote this by e(d). Then we have 


L prim (s,L* + ,ijs) = 2 2s 

/=t 


\S(d)\ 
p s f 2s e(d) 


By virtue of the relation between primitive quadratic forms and proper primitive 
ideals of an order of a quadratic field, |S(<i)| is the class number /? ( / 2 D k) of the 
order D / of the imaginary quadratic field K = Q( J— p ) . By the class number 
formula, we have 


L prim (s,L*,,f) = 2 : 


2 s 


h(P K ) 

I £>max\P' 


E/ 

/= i 


1— 2s 


n (i-V(?)). 

q\f ^ ^ 7 

q'.prime 


Here by Theorem 10.12, we have 

h(Dx) _ 

I &max\P S 2 P S ’ 


The above sum is calculated as 


E/ 1 "nO-V^E/ 1 2 * n (i+4(#(?)) 

/ = 1 q\f V q 7 /= 1 «!/ V q 7 

q'.prime 

oo 

= E ^ 1-2s E 

/ z— / m 


/=i 


i|/ 


If we put / = in n here, the above sum becomes 
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oo 

£ n x - 2s m- 2s f{m)n{m) = f(2 j - 1) f] 0 ~ iK?)? -2 *) 

m,/7=l q'.prime 


= £(2s — 1)L(2 j, i/r) 


Combining all the above considerations together, we get the expression of 
L(s,L*^_,\lr) in the theorem. As for special values, it is an immediate consequence 
of the fact that the values of the Riemann zeta function at integers not more than 0 
are given by £(1 — m) = —B m /m (p. 72, Theorem 5.4). □ 

The above relatively easy considerations gave us a chance to start thinking that 
zeta functions of prehomogeneous vector spaces consisting of symmetric matrices 
would be easy objects. As things turned out, we found out that zeta functions for the 
vector space of symmetric matrices of any degree are described by previously known 
standard objects such as the Mellin transform of Eisenstein series of half-integral 
weight and the Riemann zeta functions, and they have a decisive application for 
dimension formulas of automorphic forms. See [46,48,49] for these developments. 

Exercise 10.22. (1) Let Dk be the discriminant of a quadratic field K and p an 
odd prime such that Xk(P ) = (^r) = 1- Show that there exists b e Z such 
that b 2 = Dk mod 4 p. Show also that for such b. the module p = Z p + 
Z b+y/DK j s an jdeal 0 f £) max whose norm is p. 

(2) Assume Dk = 1 mod 8 and put p = Z2 + Z 1+ £^ -. Show that this is an ideal 
of D max and A(p) = 2. Also when Dk = 4 mod 8 (i.e. Dk = 12 mod 16), 
give a prime ideal p with N(p ) = 2. 

(3) Show that if a is anideal of D max and N(a) is prime to Dk, then xk(N(o)) = L 

Exercise 10.23. (1) Let K be a quadratic field and Dj is the order of conductor 
/. Let n and b are proper D j ideals which are prime to /. Show that if a = ba 
for some a G K x , then a = /3/y for some /3 e D / prime to / and y e Z 
prime to /. 

(2) We put K = Q( \/3), LL = Z + Z(2\/3) and a = 4+ y/3. Show that, although 
N(ce) = 13 is prime to 2, there is no pair y e O 2 prime to 2 such that 
a = P/y. 

Exercise 10.24. Let a be a proper D / ideal. We define the norm N(a ) of a by the 
cardinality of the set D// a. 

(1) If a = Za + 7j b+ ^ D with a > 0, then show that N(a) = a. 

(2) Assume that a = Z<z>i + Ztib- Then show that N(a) = \(co\C 02 — a>jtt)\)l \[D\. 

(3) Let a e D/. Then show that N(aa) = \ N(a)\N(a). 

Exercise 10.25. Let a = Za + Z b+ be a proper primitive D / ideal and put 


a = Za + 


— b + \f~D 


Z. 


2 
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( 1 ) Show that a is also a proper primitive D / ideal and that 

ad = aD f. 

(2) Denote by Q(x,y) a representative of quadratic form corresponding to the 
narrow ideal class containing a. Then show that the following conditions are 
equivalent. 

(i) Q(x, y) belongs to an ambig class. 

(ii) a 2 = aD / with N(a) > 0. 

Exercise 10.26. Let a = Za + Z h+ ^ D be a proper primitive 0/ ideal, where 
D = f 2 Dk for some fundamental discriminant D k ■ Assume that b = 0 mod a. 
Show that a 2 is a principal ideal aD /. 

Exercise 10.27 (Product of “einig” pair of ideals). Denote by D k the fundamen- 
tal discriminant of a quadratic field K. Fix a discriminant D = f 2 D k and consider 
two proper primitive D j ideals 


a, = Z a, + Z 


bi + \/~D 
2 


where a, > 0 (i = 1, 2). As is shown in Chap. 6, there exist integers c\ and 
Co such that D = b\ — 4a\Ci = b\ — 4a2C2, and we have gcd{a\,b\,C\) = 
gcd(a .2 , bo,C 2 ) = 1 • In this exercise, we will obtain the free basis of the product 


_ b2 + \[D b\ + \fD b\b2 + D + {bi + 62) VD 

Cli 0-2 — £jCI\C12 H - Z id\ b H - Z . 


We assume that gcd(a\,a 2 ,(b\ + ^2)/ 2) = 1. (Here we note that b\ = Z?2 = 2 
since b\ = D = b\ mod 4.) Such a pair of ideals is called einig by Dirichlet (see 
[29]). We fix integers x, y, z such that xci\ + ya 2 + z(b\ + £>2)/2 = 1. Define a 
rational number bo by the relation 


&2 + oJ~D b\ + \[D b\b2 + D + ( b\ + bT)oJ~D bo + \I~D 
xa\ 1- ya 2 h z 


b\ - b 2 

a = x — 1- c\z , 


a b 2~b\ 

P = y — ^ H c 2 Z, 


y = (c 2 x + ci y) 


(1) Show that bo is an integer. 

(2) We put 


2 
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Then, show the following equalities. 


( b\ + \f~D \ / bo + \J~D \ 

a 2 I I = a 2 I I + a x a 2 a , 

( b 2 + \fl)\ ( bo + \fD \ 

a\ = «i - + aia 2 ji. 


b\b 2 + D + (b\ + b 2 )^/~D / b\ + b 2 \ bo + ~J~D 




+ a\a 2 y. 


( 3 ) Show that 


O1O2 = 7 jCi\a 2 + Z 


bo + \f~D 


and N(aia 2 ) = ci\a 2 . 

bl — D 

( 4 ) If we put Co = — , then this should be an integer. Show that 

4fll£Z2 


c 0 = (ciz + b\x){c 2 z + b 2 y) + ( a\x + a 2 y)(c 2 x + c\y) 


= aft — y. 

(Note, e.g. that b\ — b\ = {D + 4 c/iCi) — (D + 4 a 2 c 2 ) = 4 (oiCi — a 2 c 2 ).) 

( 5 ) For variables x\, x 2 , y 1, y 2 , put 

X = (xi + yia)(x 2 + y 2 / 3 ) - c 0 yiy 2 = x x x 2 + ocx 2 y\ + fix\y 2 + yyiy 2 , 
b 1 T b 2 

Y = a x x\y 2 + C/2X2V1 H — yiy 2 . 

Then show that 

(a \x\ + b\x\y\ + c\y\)(a 2 x\ + b 2 x 2 y 2 + c 2 y \ ) = a\a 2 X 2 + b 0 XY + c 0 Y 2 . 


Exercise 10.28 (General case). Fix a discriminant D = f 2 DK and take two 
proper primitive D f ideals ai and 02 as in the last exercise. This time, we assume 

that 

( b\+ b 2 \ 

gcd I ai,a 2 , — - — I = /, 

where I might be greater than 1 . Take integers x, y, z such that 

, , ^1+^2 , 

xa 1 + ya 2 + z — - — = /• 
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We write ci\ = la\,a 2 = la 2 and 1,1 + b2 = IB. 

(1) Put 

bo = xci\b 2 + ya 2 b\ + z- 


\b 2 + D 


Show directly that bo is an integer and divisible by I . 
(2) Put 

b\ - b 2 

a = x — ; 1- ciz, 


= y- 


2 

b 2 - b\ 


+ c 2 z, 


y = (xc 2 + yc i) 
Show the following equalities. 


b\ + y/l)\ , ( bo + l ~J15 \ , , 

a 2 | ^ = a 2 | + la x a 2 u, 


a i 


, 2 ) 'V 2 > 

+ \[d\ f ( bo + l \[T) ^ 


= a, 


+ lu^Cl 2$ , 


% + Vd\ / b 1 ± jD\ = B / bo + lVD \ + laW ^ 


(3) We put m = gcd(l,a , /3, y). Show that m = 1 by the following steps. 

(i) 

b\~b 2 , , 

= a, a — a-,p g mZ. 


Hence c\z € mZ. 

(ii) 

, , , b 2 b\ 

B(c\z) — a 2 y = c id — Bx. 

Hence ci G mZ. 

(iii) We have bl + bl = IB & mZ and hence b\ G mZ. Also we have 
a\ = la\ G mZ. 

(iv) m = 1 since gcd(a\, b\, ci) = 1. 
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(4) Show that 


aid 2 = Z la\a' 2 + Z 


bo + / \f~D 


— / | Z “l” Z 


&o/ 1 + 


where 7>o / 1 is an integer. 

(5) Show that 

Auction) = N{ai)N(a 2 ). 

(6) Find integers a 2 , b 2 , c 2 , X and Y similarly as in the last exercise such that 

(a\x\ + bix\y 2 + c\y\)(a 2 x\ + b 2 x 2 y 2 + c 2 y 2 ) = + b 2 XY + c 3 F 2 . 


(Hint: Write down the product of general elements of ct] and 02 by the basis 
of aid 2 and take the norms. For example, put a 2 = a[a' 2 , b 2 = bo/ 1, c 3 = 
b 2 — D 

] , , , and X = lx\x 2 + ux 2 y\ + fixiy 2 + yviy 2 , Y = a\x 2 y\ + a\x\y 2 + 
\a\a 1 

By\y 2 . ) 

Exercise 10.29. (1) We fix D = j 2 D k- Show that for a natural number in and a 
proper O f ideal a, there exists a proper D / ideal b which is equivalent to a and 
whose norm iV(b) is prime to in. 

(2) For two proper D/ ideals cq, a 2 , show that A^aicq) = N(ai)N{a 2 ) by using 
Exercise 10.27 but not using Exercise 10.28. 

Exercise 10.30. (1) For K = Q(V5), let Of be the order of K of conductor /. 
By using Proposition 10.18, count the number of proper O f ideals of norm 4 
and 16 for / = 1, 2, 3. Also, give them all explicitly. 

(2) For K = Q(\/TT) and the order £> 3 of K of conductor 3, give all the proper 
ideals of norm 9 explicitly. 

Exercise 10.31. For K = Q(V7), we take the order D 3 of conductor 3. Put L = 
3Z + V7Z. Show that L is a proper D 3 lattice, though it is not an integral ideal of 
D 3 (i.e. not contained in D 3 ). 

Exercise 10.32. For K = Q(V82), let O max = Z + Z >/82 be the maximal order 
and 0 2 = Z + 2\/82Z the order of conductor 2. 

( 1 ) Show that a = 2Z + V82Z is an ideal of O max . 

(2) Show that a is not prime to 2 and that 0 fl D 2 is not a proper ideal of D 2 . 

Exercise 10.33. Assume that q is a proper D / ideal. We write a by the standard 
basis as a = l(Za + Z (d + fco)) with N(d + fco) = ac. We assume that a + 
/ £>max = £) f . Prove that a + f 0 f = D f by the following steps. 

(1) Two numbers l and / are coprime. 

(Hint: If there exists a prime p such that p\I and p\f, then a + f 0 max C 
P0max > which is a contradiction.) 
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(2) Two numbers a and / are coprime. 

(Hint: We have ac = N(d + fco) = d 2 + dfTr(u>) + f 2 N(co). If there is a 
prime p such that p\a and p\f , then p\d , and o C pO max , which is again a 
contradiction.) 

(3) We have a + /£>/ = O f . 

(Hint: By (1 ) and (2), two numbers al and / are coprime and we have x, y e Z 
such that xal + yf = 1.) 



Chapter 11 

p - adic Measure and Rummer’s Congruence 


In modem number theory, the p - adic method or /j-adic way of thinking plays an 
important role. As an example, there are objects called p - adic L-functions which 
correspond to the Dirichlet L-functions, and in fact the natural setup to understand 
the Kummer congruence described in Sect. 3.2 is in the context of the p - adic 
L-functions. To be precise, a modified version (by a suitable “Euler factor”) of 
Kummer’s congruence guarantees the existence of the />-adic L-function. 

To discuss this aspect fully is beyond the scope of this book, but in this chapter we 
explain the p - adic integral expression of the Bernoulli number and prove Kummer’s 
congruence using it. Interested readers are advised to read books such as Iwasawa 
[51], Washington [100], Lang [66], 

We assume the basics of p - adic numbers. For this we refer readers to Serre [83, 
Ch. 1] or Gouvea [37], The results in this chapter are not used in other chapters. 


11.1 Measure on the Ring of p - adic Integers and the Ring 
of Formal Power Series 

In this section we review the general correspondence between measures on the ring 
of p- adic integers Z, p and the ring of formal power series. We use this setup in the 
next section to define the Bernoulli measure on Z p and to express Bernoulli numbers 
as integrals. This expression turns out to be very useful in proving Kummer’s 
congruence relation. 

Let Q p be the algebraic closure of the field Q p of /?-adic numbers. The p- adic 
absolute value | | of Q p (normalized by \p\ = 1/ p) is extended uniquely to Q p . We 
use the same notation | | for this extension. Then Q p is not complete with respect 
to this absolute value, and the completion is denoted by C p . The absolute value | [ 
also extends naturally to C p . Let O p be the ring of integers of C p : 

O p = {x e C p | \x\ < 1}. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_ll, © Springer Japan 2014 
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Remark 11.1. Like the complex number field C, the field C p is complete and 
algebraically closed. To do analysis in the p - adic setting, we need this big field. 

First we review the general theory of measures on Z p . 

Denote the Z-module Z/ p"Z by X„ and the canonical map from X„+\ to X„ by 
7T„+i, so jr„+i : X n+ \ — » X„ is defined by 

x mod p n+] Z i — > x mod p" Z. 

The system of pairs (X„.n„) gives a projective system and we have the projective 
limit lim X„ : 


lim X n = |(*„) € Yl X n I n n + \(x n + \) = Xn 


n> 1 


The ring of p - adic integers Zp is identified with this projective limit lim X„ . 

Definition 11.2 (Measure on Z p ). A set of functions // = J/i„ jjjL, is called an 
Op -valued measure on Z p if the following two conditions are satisfied: 

(i) Each ii n is an Op-valued function on X n , /i„ : X n — > O p . 

(ii) For any n e N and x e X„ , the distribution property 

iin(x) = Mn+i (y) 

ySX„_|_i 

n n +i(y)=x 


holds. 

The set of O p -valued measures on Zp is denoted by A4(Z p , O p ) . This has an 
Op-module structure. Further, the norm of /i = J //„ j e Af (Zp, O p ) is defined as 

IN = SU P \Hn(x)\. 

n€N, A'SX„ 

Also, the Op -module of continuous O p -valued functions on Zp is denoted by 
C(Zp, Op), and the norm ||^|| of an element (p e C(Z p , O p ) is defined by 

IMI = su p I^WI- 

ASZp 

For ( p € C(Zp, Op) and p. = {pi,,} € A4(Z p , O p ) , the integral on Z p is defined by 

r p "- ] 

/ (p(x)dii(x) = lim V' <p(r)fi n (r). 

1 7 n — >oo z — ' 

JL P r = 0 
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(We use the abbreviated notation for ji„ (r mod p" ). A similar abbreviation 

will be used in the following.) The convergence of the limit on the right-hand side 
is guaranteed by the following estimate: when n < m, we have 

p"- t P "'-\ 

| ^ <p(r)/x„(r) - Y (p(l)n m (l) 

r=0 1=0 

P n ~ 1 p m ~ n -l 

^2\<P(r)ii n (r)- Y <P(r + p"q)Hm(r + p n q)) 

r= 0 q= 0 

p n — 1 / p m n — 1 

Y I X! ( < P(r)- < p(r + p"q))p m (r + p"q) 

r = 0 y q = 0 

< max | (p(r)-(p(r + p”q) | ||/r||. 

r, q 

For each natural number k, the binomial polynomial 




O t(t — 1) • • • (t — k + 1) 
= k\ 


in t is a continuous function on Z p . 

To p = {p„} e A4(Z p , O p ) we associate / e 0 p [|W]] in the following manner. 
Set A = 0p[[X]], A„ = ((1 + X) pn — 1 )A and consider the projective system 
{{A/ A n ,m n )} by the natural map m n : A/A„ — > A / A n -\. Define f„(X) e 
A/A n by 

p"~ 1 p "- 1 '• / \ P n ~ l 

fn(X)= Y^(r)(i+xy = K = X c ”- kXK 

r= 0 r = 0 k=0 \ / k=0 

Here we understand that the equalities are mod A„ and put 

p n - 1 

C n ,k = ^2 P n W 

r = 0 

Since we have 

(m„f„)(X) = 77T„ ^ ^ p n (r)( 1 + A)' j 
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( P n '-lp-l 

E E + ^" _1/ )( i + xy\\ + xy "~ 11 

r '= 0 1=0 

p n ~'-\ 

= lln-l(r')(l + xf 

r '= 0 

= fn-l(X), 

the system ( f n ) is an element in the projective limit lint A/A„ . Now we have the 
isomorphism 


A ^ \xmA/A n , A3g i — » (g„) G Inn A/A n , 

where, for g e A, the system (g„ ) is given by g n = g mod A„. Through this 
isomorphism, the above {/„} corresponds to / e A by 

OO 

f(X) = c m X m , 

m = 0 

where 

c m = lim V Hn(r)\ ' ) 

n-*o o z — ' \ m / 

r=0 \ ) 

We therefore have obtained a map from A4(Z p , O p ) to 0p[[W]]. An important 
fact is that this map gives a natural isomorphism between A4(Z p , O p ) and the ring 
of formal power series often referred to as the Iwasawa isomorphism. The 

way to associate a measure to an element in O p [[A]] is described as follows. 

For / = Ern= o c mX m G O p [[X}], define p = {p„} by 

^) = lEr/((- 1) (reX„), (11.1) 

P f p " = i 

the sum running over all p n -ih roots £ of 1. Since |£ — 1| < 1, /(£ — 1) converges. 
For each m > 0, we have 


1 

P* 


E r r (t- 1)‘ 

y=\ 


i E E r ("W'*' 

F £P n = 1 7=0 W 
0<j<m V / 

y =r mod p n 
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So this is contained in O p . In particular, if p" > r > m . then this is zero. When £ is 
a primitive p v - th root of 1 (v > 1), the equality 


holds and hence 


|£ — \ \ v ^p ) = \p\ (<p is the Euler function) 


|(f- l) m | = \p m !^\ 


From this, we conclude that p e divides the quantity 


0<j<m 
j=r mod p n 




for e = m/<p(p ") —n. Therefore, 


/*.(<•>=£ >„(T E r'«-ir) 

m= 0 1 / 

is convergent and the value is in O p . To check the distribution property (ii) of the 
measure, we need to calculate the following value: 

^2 Mh+iO) = ^2 l l n+\(x + p"a) 

y€X n +i, Tc n j t \{y)=x a mod p 


= E ( E r'-^/s-D. 

“ i ya mod p J 


Using the identity 


E 

a mod p 


0 if ^ 1, 

p if tf = 1 


for a p n+1 - th root £ of 1, we have 


E r'-''' = p E r 


a mod p 
tp “ + '= i 


so we have 


X! M»+iOO = MnW 


reX, i+i 

^n + l(v) = -V 
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which is to be proved. If we define the formal power series / e (^[[X]] 
corresponding to this measure defined as before, then the coefficients c' k of X k of 
this series are given by 


Ci. = 


lim 

n—too 


p n - 1 

E 

/•= o 



= lim Y] c m V' 

n — >oo ^ ^ ' 


m = 0 


r=0 



E 

0<y <m 
;=r mod 



We fix A:. To calculate the coefficient of c m in the expression of c' k in the right-hand 
side above, we fix m. We have (J) = 0 for k > r so we may assume that k < r. 
Taking n big enough, we assume that m < p" . Then, if j = r mod p" for some j 
with 0 < j < m, we have j = r since we also have 0 < r < p n — 1 by definition. 
So we may assume that k < r = j < m. So the coefficient of c m is given by 


E 




1 if m = k, 
0 if m ^ k. 


Hence we have cl = Ck ■ So we have / = / and two mappings are inverse with 
each other and we see that the set M{Z, p .O p ) of O p -valued measures and the space 
of formal power series Op[[^f]] are bijective. 

More precisely, we can introduce a product for both spaces and show that these 
are isomorphic as O p algebras, as given in the following theorem whose complete 
proof is omitted (see e.g. Lang [66, Ch.4]). 

For two measures /x, v e Xi(Z p , O p ), we define an C^-valued function (/x* v) n 
on X„ by 


P n - 1 

(p * v) n {x) = ^2 V-n(y)v n {x -y) {x € X„ ). (11.2) 

y=0 


Then /x * v = {(/x * v)„} becomes an element of A4(Z p ,O p ). We call this a 
convolution product of /x and v. The set A4(Z p , O p ) becomes an O p algebra by 
this product /x * v . 

Theorem 11.3 (Iwasawa isomorphism). Between the space M(Z p ,O p ) of Op- 
valued measures and the ring of formal power series C^[[2f]], there is an O p 
algebra isomorphism P : Ai(Z p , O p ) — > O p [[A']] given by 

OO 

M(Z p ,O p ) 3 fi = {ii n }^f(X) = J2 c m X m e O p [[X}]. 

m = 0 
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Here, c m is determined by p.: 


Cm 



dp(x ), 


and conversely p n is determined by f : 


in(x) = - n y r x m- 1 ). 

p n 

tp"= i 


For convenience of the description below, we recall Mahler’s 1 theorem giving 
the necessary and sufficient condition for an O p -valued function on Z p to be 
continuous. 

Theorem 11.4. The function < p : Z p — > O p is continuous if and only if it can be 
written as 

OO 

<p(x) = ^2 a,r 

n = 0 

If this is the case, the coefficients a„ are uniquely determined by (p and given by 

a n = D-D-Ttw. 

k= 0 V / 

We omit the proof (cf. Lang [66, §4.1]). 

If we use Theorem 11.4, we can understand a part of Theorem 11.3 more 
intuitively as follows. Fix xo e Z. Denote by (p the characteristic polynomial of 
Xo + p"Z p . Then by the definition of the p - adic measure, we see easily that 


a u £ (D p 


L 


(p(x)dp(x) 


o). 


So if we replace (p(x ) by the expansion (p(x) = a m (*) in Theorem 1 1 .4, we 

have 


OO 

H'ni.X o) — ^ ^ Cm 

m= 0 


m = 0 k = 0 



<p(k). 


'Kurt Mahler (bom on July 26, 1903 in Krefeld, Prussian Rhineland — died on February 25, 1988 
in Canberra, Australia). 
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Now, for any £ with t, p " = 1, we have 





Since (p(k) = 1 if k = Xo mod p n and <p(k) = 0 otherwise, we have 


1 


£ r xo (-i r~ k <p(k). 

n n — i 



So we get the expression of p(x) by / in Theorem 1 1.3. 

We describe here several useful properties of the correspondence P in Theo- 
rem 11.3 between measures and formal power series. Let the maximal ideal of O p be 


V = {ze O p \ \z\ < 1}. 


For zeP, define the function (1 + z) x in x by 



By Mahler’s theorem, (1 + z) x is a continuous function of x e L p . When x is a 
non-negative integer, this definition of (1 + z) x coincides with the usual binomial 
expansion. We have the relation 


(1 + z) x ( 1 + z) x ' = (1 + z) x+x ' o, jc' e Z p ). 


(11.3) 


This is obvious for x, x' G N, and the general case for x, x' e Z p follows from the 
fact that the set N of natural numbers is dense in Z p . 

In the following, we list several properties of measures and corresponding power 
series, which will be used later. 

Property (1). Let z € V. If p corresponds to / (i.e. Pp = /), then 



In particular, by putting z = 0, 
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oo 

Proof. Writing f(X) = c n X" , we have by Theorem 1 1.3 

n = o 


f 0 + z) x dji(x) = 

z , 



= £ c n f = f(z). 

n = 0 


□ 

We call the map A from C(Z p .O p ) to O p a bounded linear functional on 
C(Z p , O p ) if the following conditions (i), (ii) are satisfied: 

(i) For any (p , (p ' e C(Z p , O p ) and any a, b e O p , 

X(aip + b(p') = aX((p) + bX((p'). 

(ii) There exists a positive constant M > 0 such that for any <p> e C(Z p , O p ). 


\X{<p)\<M\\cp\\. 

The norm of A is defined by 

l|A|| 

Let A be a bounded linear functional on C(Z p . O p ). For x e X n = Z/ p"Z, write 
the characteristic function of x + p"Z p as <p x ,n- If we put 


sup 

ip€C(Zp.Op) 

ip^O 






then /x = {Hn} is an Op -valued measure on Z p (i.e. /x € A4(Z p , O p )). Conversely, 
given /x = {/x„} e M(Z P , O p ), if we put 



<p(x) dji{x), 
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then A is a bounded linear functional on C(Z p ,O p ). This correspondence between 
A and p is easily seen to be one to one. 

Moreover, for h e C( Z p , O p ) and p e M(Z p . O p ) , the map 

(p i — > / (p(x)h(x) dp{x), {cp € C(Z P , Op)) 

J z p 

is a bounded linear functional on C(Z pi O p ). Let hp be the corresponding measure. 
It is an interesting problem to compute the formal power series corresponding to the 
measure hp when p corresponds to / = Pp e 0p[[Af]]. Properties (2) and (3) 
below give examples of this correspondence. 

For / e 0„[[X]], put 

(U/)(X) = /(X) ~-T /(£(! + *) - !)• (11.4) 


Since 


- E m + X)-\)) 1 eZ p [X] 

y i 

for non-negative integers /, we have U / e 0 P [[X]]. 

Property (2). Let / e O p [[X]] and pp be the corresponding measure. Also, let i jr 
be the characteristic function of Z* . Then the formal power series corresponding 
to the measure ^rp/ is U /, i.e., \/spf = pu /. More precisely, we have for any 
<p e C(Z p , O p ) 


/ (p{x)f{x)dp f (x) = / <p(x) dpufix). 

J z„ J z„ 


This can also be written as 


/ (p(x)dpf(x) = / (p(x)dp V f(x). 
J z* Jz„ 


Proof. Write the power series corresponding to the measure \[rpf as g. When z & V, 
by Property ( 1 ) we have 


g(z)=[ (1 + z) x f(x)dpf(x). 

Jz p 

Let £ be a /tth root of 1. Regarding i// also as a function on Z/ /;Z via i//(c; mod 
/>) = xf(a + pZ p ), and putting 
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fi.0 = “ E 

n ‘ J 


fl€Z / pZ 

(Fourier transform on Z/ p Z) we have 

i Ha) = E " 

t;p=\ 

by a simple calculation (inverse Fourier transform). Since 


HO = A 


if £ 7^ 1 - 
if? = 1, 


by the definition of t jr, we obtain 

g(z) = f (1 + z) x f(x)dfi f (x) 

Jz p 

= ( (i+z)*E nm xd nf(.x) 

jzp 


i p = i 

(1 + z) x Odfi f (x ) 


?p=i 


= E to f (l + (?(1 + 2) - 1))* 

fr=l *' z p 

= E Horn i + z) - 1) = /(z) - - E + z) - !)• 


^=i 7 ^=i 

This shows g = U/. (Here we define the power for i G by 


r = ( i + c-ir = E rtf- 1 )"- 


If we choose a e Z so that x — a G /tZ^, we have = 0 ■) □ 

Define the differential operator D on the ring of formal power series 0^[[X]] by 


where Dy = — • 
dX 


D = (1 + X)D x , 



194 


1 1 p-adic Measure and Kumraer’s Congruence 


Property (3). For / e 0p[[X]], the power series corresponding to the 
measure xfij is Df . Hence the power series corresponding to the measure 
x k ii f ( k natural number) is D k f and the equalities 


f x k djifix) = f dn D k f (x) = (D k f)( 0) 
J Z„ J z„ 


hold. 

Proof. It is enough to show this when k = 1. Let g e (^[[A']] be the power series 
corresponding to the measure x/i / . By Property (1), we have for z G V 

g(z) = f x(l +z) x dp. f (x). 

Jz p 

Put f(X) = E„°°=o a n X\ g(X) = E,r= o bnX*. Using 



and Theorem 1 1.3, we have 


= (« + 1) 




n = iz («) dflg ^ = jy 


= (« + 1) 

= (n + l)a„+i + na„ 
On the other hand, Df is computed as 


(Df)(X) = ((l+X)D x f)(X) 

= (1 + X)(a\ + 2a 2 X + ■ • ■ + na n X” 1 + ■ ■ ■ ) 

oo 

= ^2 {{n + l)a„+i + na„)X". 

n=0 


This gives g = Df . □ 

In general, for a power series f{X), we define a new power series f*(Z) in Z 
by setting X = e z — 1: 


/* (Z) = f{e z - 1). 


(11.5) 
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For example, when 


we have 


f(x) = d + xr 


oo 


E 



x\ 


f*(Z ) = e' 


oo 


E 


a" Z" 
nl 


Note the identity 

(D k z f*m = (D k /)(0) 


(11.6) 


since 


D z f*(Z) = (1 + X)D x f(X) = Df(X). 

The next property is the basis of the fact that the isomorphism P in Theorem 11.3 
is an O p algebra isomorphism. 

Property (4). Let the measures //, v correspond respectively to the power series 
f g e 0p[[X]] (i.e., p, = ptf, v = n g ). Then the power series corresponding to 
the convolution // * v is fg: 


Pf * Pg — Pfg- 

Proof By Eq. ( 1 1 . 1), we have 

Pn(r)= - n £ r/tf- 1). 


i “ n = 1 


Vn{k-r)=- n r k+r g(^~ 1). 


t p =1 

Substituting this into the right-hand side of (11.2), we obtain 

p n - 1 

— V r r m- 1: 

p 


in*vu k )='£ E E rv(?-i)E E r* +r *G- d 

r=0 ” £P n = 1 ” ^P n = | 

= i E E - !)r* • E £ (*/o r 


f ? 
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— ! l fg, n (k)- 


Here £ and f run through all p n - th roots of 1. From this, Property (4) follows. □ 

11.2 Bernoulli Measure 

We define a specific measure called the Bernoulli measure. Recall that the first 
Bernoulli polynomial is by definition equal to 



In the following, p denotes an odd prime. For each natural number n and x e X n = 
Z/p n Z, set 



where in the right-hand side, we regard x as an integer representing x mod p" , and 
for w e R, {w} is the real number satisfying 0 < {vv} < 1 and vv — { vv } e Z (the 
fractional part of vv). Then E = {E n } is a measure on Z p but is not Op-valued. We 
modify this as follows in order to have an Op-valued measure. Take an invertible 
element c in Z p (i.e. c eZj), and for x e X„ = Z / p" Z, let 


E c ,„(x) = E n (x) - cE„(c l x). 


We understand c 1 x as an element in X„ = Z/ p"Z. It is easy to see that E c = 
{E c „} is an Op-valued measure. We call this the Bernoulli measure. 

Proposition 11.5. (1) The formal power series corresponding to the Bernoulli 
measure E c is given by 


fc(X) = 


1 


c 


x (i + xy - i 


(2) Let k be a natural number. For c eZ* with c k 1, we have 



In particular, if p — 1 \ k, then B^j k e Z (p ). 
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Proof ( 1) Since c e Z*,we see f c e Z^X]], the first two terms of f c (X ) being 

c — 1 1 — 

f c (X) = + X + • ■ • . 

7 V 2 12 


Let fi = {//„} be the measure on Z p corresponding to f by Theorem 1 1.3. For 
r e X„ = Z//?"Z we have 


Hn(r)= 4 E r r /c«-l) 


=1 


-3r/ c ( °)+^T E £ r (j3T - ^Ti) • 


1 

E 


Now we use Lemma 8.5 on p. 110. For = 1, £ / 1 and / = p'\ the lemma 
gives 


1 


1 /_1 

yEif 

7 7 = 1 


cj 


since (c, p) = 1. By this, if we choose / so that cl = k mod p n , 0 < l < p n , 
we obtain 


1 

7 


E 

^”=1,^1 



— E 

f 2 L^, 

7 i;P n = 1 


/-I 


r* E - 


c(/-l) 

2 / 


d c(f - 1) 

/ 2 / 


= c 


c | c , c 
”p"J 2 + 2f 


and by substituting this into the formula for /x„ (r) above and noting that f c (0) = 
(c — l)/2, we have 


P„(r) 



1 1 

— + T T — C 
2 2 / 






By the definition of the Bernoulli measure, we conclude //„(/') = E c n (r ), i.e., 
/x = E r and the power series corresponding to E c is f c . 

The proof of (2) goes as follows. By Property (3) and Eq. (1 1.6) we have 
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x*- 1 dE c = c D k - i f c xo ) = (D k 7 r i f ( *m. 


Here by definition (11 .5), we have 


f*(Z) = f c {e z — 1) = 



c 


e cZ — 1 


= £ 


(i 


Z ' ,_1 

77! 


so we have 


and thus 


This gives (2). 





x k ~ l dE c = (l-c k )(-l) k ^. 

k 


□ 


11.3 Kummer’s Congruence Revisited 

The “right” formulation of Kummer’s congruence is the following. 

Theorem 11.6. Suppose p is an odd prime. 

(1) Assume that m is a positive even integer such that p — 1 \ m. Then B m /m e 

Z( P )- 

(2) Let a be a positive integer, and m and n positive even integers satisfying m = 
n mod (p — 1 )p a ~ 1 and m ^ 0 mod (p — 1). Then we have 

(1 - p m ~ 1 )— = (1 - p n ~ 1 )— mod p a . 
m n 

To prove this, we need the following integral expression of the Bernoulli number, 
a refined version of Proposition 1 1.5 (2). 

Proposition 11.7. Let k be a positive even integer and take ceZ*. Then we have 

(l-c k )(l-p k - l )^= f x k ~ l dE c . 
k J zy 
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Proof. The power series that corresponds to the Bernoulli measure E c is f c in 
Proposition 1 1.5. As in (1 1.4), define from f c a new power series g by 

g(X) = u. f c (X) = fc(X) - - V / c (£(l + X) - 1). 

We have g e O p [[A]] and so we let fi = /i g be the measure on O p obtained from g. 
By Property (2) on p. 192 we have 

f x k ~ ] dE c = f x k ~ x dfi. 

J z? •'z , 

Further, using Property (3) on p. 194 and (1 1.6) one sees 


L 


x k ~ x dp, = (D*~ l g)(0) = {D k ~ l g*){ 0). 


\k - 1 


k— 1 \ 


We compute the value (D k z 1 g*)(0) . First, 

1 


g*(Z) = 


e z — 1 e 


cZ 


1 p ^ “ 1 ^ c e cZ — 1 ) 


Here, since 


1 v 1 - 1 

~p tx- 1 “ XP-l' 


we get 


g*(Z) 


e z — 1 e cZ — 1 \e pZ — 1 e cpZ — 1 

oo R oo „ 

= £ (1 - c k )(-\) k ^z k - x - £ (1 - c i )(-i) /c -^(pZ) A_1 


k=0 


k=0 


k\ 


oo o 

= £(l-c t )(l-/- 1 )(-l)*g-Z*- 1 . 


fc= 1 

Hence if /r is even we have 


(D k z -'g*m = (l-C k )(l-p k - 1 ) 1 y, 


and the proposition is established. 
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Proof of Theorem 11.6. The first assertion is already given in Theorem 3.2, but we 
give here an alternative proof for that too. Since we assumed m f 0 mod p — 1, we 
can take c e Z such that (c, p) = 1 and c m f 1 mod p. For instance one may take 
a primitive root mod p. From the proposition above, we have 

(l - c n )(l - p 11 - 1 )— = f x n ~ l dE c 

n hf 

and 

{\-c m ){\-p m - l )—= [ x m ~ l dE c . 
m J Tf 

The assumption m = n mod (p — 1 )p a ~ l gives c n ~' n = 1 mod p a , and since we 
assumed (1 —c m , p) = 1, we have also (1 —c n ,p) = 1 . Since E c is an O p measure, 
the above integral values are in O p and we see that B„/n and B m /m e Since 
x m ~ l = x n ~ l mod //' if A' e 7/ p , and since E c is an C^-valued measure, we have 

{\-c n )((\-p n ~ 1 )— -(\-p m - 1 )— ) e P a O p . 

\ n m ) 

The left-hand side being contained in Z^, we conclude 

d-p'- 1 )— -(I-/*"- 1 )— € p a Z p . 
n m 

This proves the theorem. □ 

Theorem 3.2 is a corollary of Theorem 11.6. Indeed, if a < m < n, then by 
Theorem 1 1 .6, we have 

(1 — p" 1-1 )— — (1 p n ~ 1 ) — 

m n 

= (1 - p m ~ x ) + — {jp n ~ m - 1 )p"-> 

\ m n ) n 

= 0 mod p a . 

Since p — 1 \ n, we have B„/n e Z( p} by Theorem 1 1.6. Since a < m — 1, we have 
p" 1 ^ 1 B„/n € p a Z,( p ). Hence we have 

— — — = 0 mod p a . 
m n 
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Exercise 11.8. Give an example of an odd prime p and integers 2 < a = in < n 
such that the congruence in Theorem 3.2 does not hold. Check that for the same 
choice of a, n,m and p, the congruence of Theorem 11.6 surely holds. 

Hint: For example, put p = 5, a = m = 2 and n = 22 and use the following 
values: 


Exercise 11.9. Show that the Bernoulli number B„ is given by the limit ( /;-adic 
limit in Q p ) 



(For a function / : Z, p — » Q ; , with a suitable condition, the limit 





is sometimes referred to as the Volkenborn integral of / over Z p . See [94,95] for 
details.) 


Chapter 12 

Hurwitz Numbers 


12.1 Hurwitz Numbers 


In this section, we briefly introduce Hurwitz’s generalization of Bernoulli numbers, 
known as the Hurwitz numbers. 

As we have seen in Proposition 1.17, Bernoulli numbers appear as the coeffi- 
cients of expansions of trigonometric functions. Hurwitz replaced the trigonometric 
function by a “lemniscate function”, which, in modern terms, is an elliptic function 
having complex multiplication by the ring of Gaussian integers Z[V— 1], and 
considered its expansion. It was Gauss who first studied the lemniscate function 
or more general elliptic functions, but most of his work on them did not appear 
during his lifetime. In his Disquisitiones [35], Gauss mentioned at the beginning of 
Chap. 7 that his theory of cyclotomy developed there could also be applied to the 
division of the lemniscate. Inspired by this comment, Abel 1 and Jacobi started their 
investigation on elliptic functions and laid the foundations of the theory. 

Let us define Hurwitz numbers. For this, put 

dx 

m = 2\ - (= 2.622057...). 

Jo 

This is one-half of the arc-length of the lemniscate defined by 

r 1 = cos(2 0), 

and is analogous to the similar quantity for the case of the circle: 



1 Niels Henrik Abel (bom on August 5 , 1 802 in Frindoe, Norway — died on April 6. 1 829 in Froland, 
Norway). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions , Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_12, © Springer Japan 2014 
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Let p(z) be the Weierstrass p-function with periods m and m V— T defined by 


p{z) = \ + 




A <E Zvj - j-Zm V — 1 
At^O 


A ) 2 



( 12 . 1 ) 


The function p(z) is a doubly periodic meromorphic function on the complex 
plane with periods m and , having double poles at the lattice points 

Zm + Z m V— I and holomorphic elsewhere (for the basic properties of the elliptic 
function, see [104]). Owing to the fact that the lattice Zm+Zm V— 1 is preserved by 
multiplication by V—l, the function z p(Az) for any A e Z[V— I ] is expressible 
as a rational function of p(z) and p'(z). By virtue of this property, p(z) is said to 
have complex multiplication by Z[V— I]- In particular, it is easily seen from the 
definition that p(z) is an even function and 

p(V^lz) = -p(z). 


Also, p(z) satisfies the differential equations 

p'(z) 2 = 4 p(z) 3 - 4 p(z) 


and 


p"(z) = 6p(z) 2 - 2. 


Write the Laurent expansion of p(z) at z = 0 as 


P(z) = 


J_ y, 2" H n Z n ~ 2 
z 2 ^ n (n — 2)! 

n= 2 


(12.2) 


and use this to define the Hurwitz numbers H n . From the property p(V—iz) = 
—p(z), we see that //„ = 0 unless n is a multiple of 4. The analogous expansion for 
the Bernoulli numbers is 


1 

sin 2 Or) 


1 ~ (-I)"-' 2” B„ x n ~ 2 

x 2 t— 1 n (n — 2)! ’ 

n = 2 


which can be obtained from the expansion of cot given in Proposition 1.17 by 
differentiation, since cot'(v) = — -4 — . In a similar manner to the proof of 
Proposition 1.15, namely by comparing the coefficients of 


p"(z) = 6 p(z) 2 - 2, 


(12.3) 
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Table 12.1 Hurwitz numbers 


n 

4 

8 

12 

16 

20 

24 

28 

32 


1 

3 

567 

43659 

392931 

1724574159 

2498907956391 

1671769422825579 

10 

10 

130 

170 

10 

130 

290 

170 


we have 


H a = 


1 

To’ 


and the recursion 


(2 ft - 3)(4 n - 1)(4 n + 1 )H 4n = 3 ^(4/ - 1)(4 n - 4/ - 1) ^ . j H 4i /7 4( „-<) 

for n > 2. In particular, we see from this that /Y 4 „ is a positive rational number. 
Several values are in Table 12.1. 

The analogy between Hurwitz and Bernoulli numbers is more apparent when we 
look at the special values of zeta functions. Namely, first we rewrite the formula 
(4.4) as 



r€ Z 


r# 0 


(- 1)"" 1 


(2 jz) ln 

( 2 «)! 




(» > !)■ 


Note the sum on the left runs over all non-zero integers. If we replace the rational 
integers here by Gaussian integers, we have the formula 

y- J__ (2m) 4 " 

^ A 4 " (4n)! 4 "' 

Xez+zV^T 

To deduce this, expand the right-hand side of the definition (12.1) of p(z) by using 
= I? + + • ' • an< 3 compare with (12.2). 

Hurwitz proved a Clausen-von Staudt type theorem for H„ . To state this, we 
need the following classical theorem of Fermat and Euler. 

Theorem 12.1. Any prime number congruent to 1 modulo 4 can be written as a 
sum of two squares essentially in the unique way. 

Various proofs of this theorem are known since Euler. See [50] or [77] for some 
of the standard proofs. For Zagier’s “One-sentence proof”, which simplifies a proof 
by Heath-Brown, see [108], 

Suppose now that p is a prime number congruent to 1 modulo 4 and write 

p = a 2 + b 2 . 
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We may assume a is odd and b is even. We change the sign of a if necessary in 
order to have 


a = b + 1 (mod 4). 

(Note that the right-hand side is independent of the choice of the sign of b since b 
is even.) These conditions uniquely determine a and so we denote it by a p to make 
the dependence on p explicit. For instance, we have 

5 = (- 1) 2 + 2 2 , 13 = 3 2 + 2 2 , 17 = l 2 + 4 2 , 


SO 


Cl 5 = — 1, $13 = 3, $17 = 1. 


Hurwitz’s theorem is then stated as: 
Theorem 12.2. For each n > 1, we have 


H An — 



E 

p = 1 mod 4 
p — 1|4 n 


An 

(2 a P )e-' 

P 


+ G„ 


with an integer G„. In the sum , p runs over prime numbers congruent to 1 modulo 
4 such that p — 1 divides An. 

As examples of the theorem, we have 


H 4 = 


= 


H 12 = 


H 16 = 
H20 = 
7-/24 = 


11 (- 2 ) 

10 2 + 5 ’ 

1 _ 1 (~ 2 ) 2 , 

10 2 + 5 

567 1 (— 2) 3 6 „ 

— ~ H - 1- 5, 

130 2 5 13 

43659 1 (— 2) 4 2 

= - + - — - + — + 253, 

170 2 5 17 

392931 1 (— 2) 5 

— = 2 + L T L+392 "' 

1724574159 = 1 + + 5! + 13265939. 

130 2 5 13 


Hurwitz also proved that every G„ for n > 1 is odd and determined its class 
modulo 8 . The proof [44] of Theorem 12.2 is not as elementary as the original 
Clausen-von Staudt theorem. It uses the complex multiplication of the p-function 
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as well as the notion of Hurwitz integral series (cf. Chap. 7). See [27,58] for a 
modern interpretation of the theorem and Kummer-type congruence for H n . 


12.2 A Short Biography of Hurwitz 

Adolf Hurwitz was born on March 26, 1859 in Hildesheim, Germany. 2 3 His teacher 
at the Gymnasium was Schubert, ' who is famous for his enumerative geometry 
(“Schubert calculus”). Impressed by Hurwitz’s genius, Schubert came every Sunday 
as a house tutor and persuaded Hurwitz’s father, who was not at all affluent, to 
send his son to the university to study mathematics. Already during his time at 
the Gymnasium, while he was still 17, Hurwitz wrote a joint paper with Schubert 
on enumerative geometry. In the spring of 1877, he entered Munich Technical 
University, where Klein 4 was. From the fall of the same year until the spring 
of 1879, he attended the lectures of Kummer, Weierstrass, and Kronecker at 
Berlin University. Returning once again to Munich, and then moving to Leipzig 
together with Klein, he obtained his doctoral degree there with research on modular 
functions. He became a “Privatdozent” at Gottingen University in 1882 and then, on 
the recommendation of Lindemann, 5 6 a professor at Konigsberg University in 1884. 
Hilbert 11 and Minkowski 7 were among the students there. The exchanges among the 
three of them in those early days and the influence which this had on Hilbert are 
vividly described in [78]. In 1892, he moved to the ETH in Zurich as a successor 
of Frobenius, 8 * a position he was to hold until his death. At the same time he was 
offered a professorship at Gottingen as the successor of Schwarz, who moved to 
Berlin, but he had already accepted the position at Zurich (the director at Zurich at 
that time, Bleuler, went especially to Konigsberg to make the contract). 

Hurwitz’s health was not good. He got typhoid fever twice and suffered from 
frequent migraines. And in 1905, one of his kidneys had to be removed. In the midst 
of all this, he continued to do research and published in a large number of areas. 


2 The description in this section is based on Freudenthal [34], Hilbert [41], and Polya [76]. 

3 Hermann Casar Hannibal Schubert (bom on May 22, 1848 in Potsdam, Germany — died on July 
20, 191 1 in Hamburg, Germany). 

4 Felix Christian Klein (bom on April 25, 1849 in Diisseldorf, Pmssia (now Germany) — died on 
June 22, 1925 in Gottingen, Germany). 

5 Carl Louis Ferdinand von Lindemann (bom on April 12, 1852 in Hannover, Hanover (now 
Germany) — died on March 6, 1939 in Munich, Germany) who proved in 1882 that ji is a 
transcendental number. 

6 David Hilbert (bom on January 23, 1862 in Konigsberg, Prussia (now Kaliningrad, Russia) — died 
on February 14, 1943 in Gottingen, Germany). 

7 Hemiann Minkowski (bom on June 22, 1864 in Alexotas, Russian Empire (now Kaunas, 
Lithuania) — died on January 12, 1909 in Gottingen, Germany). 

8 Ferdinand Georg Frobenius (born on October 26, 1849 in Berlin-Charlottenburg, Pmssia (now 

Germany) — died on August 3, 1917 in Berlin, Germany). 
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His articles have been collected and comprise two volumes. As well as the Hurwitz 
zeta functions and Hurwitz numbers treated in this book, there are many other topics 
which carry his name, such as the Hurwitz class number relations, Hurwitz’s genus 
formula, the Hurwitz estimates for the size of automorphism groups of Riemann 
surfaces and his investigations on the quaternions and other algebras. And according 
to Hilbert, continued fractions were also a favorite topic of Hurwitz. (He wrote 
approximately five articles on the subject.) 

Hurwitz seems to have been an exceptionally modest and courteous man. 
According to Polya, 9 who was the main editor of his collected works, he never 
failed to tip his hat in greeting even to service people whom he met on the street. 
Moreover, he was so skilled at the piano that in his youth he hesitated between a 
career as a mathematician or as a pianist. He died on the 18th of November, 1919, 
at the age of 60. 

Exercise 12.3. Deduce the recursion of H„ given in the text from (12.3). 

Exercise 12.4. Compute the residues 2H n mod 1 6 for first several n , and find the 
pattern. (For a proof, see [44].) 


9 George Polya (born on December 13, 1887 in Budapest, Hungary — died on September 7, 1985 in 
Palo Alto, USA). 


Chapter 13 

The Barnes Multiple Zeta Function 


In this chapter, we introduce Barnes’ multiple zeta function, which is a natural 
generalization of the Hurwitz zeta function, give an analytic continuation, and then 
express their special values at negative integers by using Bernoulli polynomials. 
Furthermore, for double zeta functions, we take up the problem of finding a 
functional equation. This problem has several interesting points. For example, a 
kind of Lerch 1 type zeta function appears in the functional equation, and also the 
proof has the flavor of the theory of automorphic forms. 

The motivation to take up Barnes’ multiple zeta function here is that this is 
deeply related with the very important and interesting conjecture about constructing 
class fields over real quadratic fields, or totally real algebraic number fields (Stark- 
Shintani 2 conjecture, see [86, 87, 89] for example). Although we do not treat it in 
this book, the value at s = 0 of the first-order derivative of Barnes’ multiple zeta- 
function is expressed by special values of the multiple gamma function. Shintani 
saw that for the construction of class fields of real quadratic fields, special values 
of the double gamma function play an important role, and he proposed a detailed 
conjecture that certain special values of the double gamma functions generate the 
class fields, and solved part of his conjecture (cf. [85-87]). For the construction of 
class fields over imaginary quadratic fields, modular forms such as the j -invariant 
play an extremely important role, but in the case of real quadratic fields, any 
viewpoint from modular forms is not considered in the above papers. The Lerch 
type function treated in this chapter was introduced with the intention of giving 
more or less such a viewpoint to the construction problem of class fields over real 
quadratic fields. For details, see [6,7]. 


'Mathias Lerch (bom on February 20, 1860 in Milinov, Bohemia (now Czech Republic) — died on 
August 3, 1922 in Susice, Czechoslovakia (now Czech Republic)). 

2 Takuro Shintani (bom on February 4, 1943 in Kita-kyushu, Japan — died on November 14, 1980 
in Tokyo, Japan). 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_13, © Springer Japan 2014 
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13.1 Special Values of Multiple Zeta Functions 
and Bernoulli Polynomials 


Let all a , w\, W 2 , ... , w r be positive real numbers. We put w = (w i, . . . , w r ) and 
define a zeta function £(s,w,a) by 

oo oo 

£(s,w,a) = E-E ( a + m\w\ + H m r w r ) s . 

m i=0 m r = 0 


This zeta function converges absolutely for Refs) > r. This kind of zeta function 
was systematically studied by Barnes 3 [1 1, 12], 

For a while, we assume Refs) > r. We see that f (,s, w, a) has an integral 
representation: 


£fs, w, a) 



\-m r w r )t ^ 


1 

W) 


1 

W) 


1 

W) 



t s - 1 E 


oo 

bm r w r )t 

m r = 0 




(1 _ e -wi/)...(i _g-w r r) 


dt 


hw r — 

i 

(e wit - 1) • • - (e Wrt - 1) 


dt. 


Since the series ^(s.v/.a) converges absolutely for Re(.y) > r. by virtue of 
Lebesgue’s dominant convergence theorem for the Lebesgue 4 integral, the above 
calculation is justified for the same range of .s. When r = 1, this is essentially the 
Hurwitz zeta function. Namely, for wi > 0, we have 


Z(s,w\,a ) = 



Now, a multiple zeta-function f (s, w, a) as well as the Hurwitz zeta function, can be 
represented by a contour integral. Indeed, taking £ > 0 small enough and denoting 


3 Ernest Barnes (bom on April 1, 1874 in Birmingham, England — died on November 29, 1953 in 
Sussex, England). 

4 Henri Leon Lebesgue (bom on June 28, 1875 in Beauvais (Oise), France — died on July 26, 1941 
in Paris, France). 
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by I(s, oo ) the contour in Sect. 9.2, we have 

\ r „(wH 1 -w r —a)t 

I(s.w,a) = — / t s_1 dt. (13.1) 

r(s)(e 1 2 «" - 1) J l(St00} (e^-1 -i) 

An advantage of this representation is that £( 5 , w, a) is continued analytically to a 
meromorphic function on the whole 5-plane by this, because the contour integral 
on the right-hand side converges absolutely for arbitrary s e C and it defines a 
holomorphic function on the whole 5-plane. 

A point which possibly becomes a pole of £ ( 5 , w, a) is among zeros 


5 = r. r — 1, . . . , 0, —1, —2, . . . 


of e 2nls — 1 which are not in the range of absolute convergence. But r ( 5 ) has poles 
of order 1 at integer points less than or equal to 0, so points 5 = 1 — m (m e N) 
cannot appear as poles and ( ( 5 , w, a) is holomorphic there. 

Let us look for special values of £( 5 , w, a) at integer points 5 = 1 — m (m e N). 
For any integer k > 0, and for any fixed a and w, we define constants C, t(w, a) by 


e a, t r 

n;=i(e w ' ? - 1) 


^2C k (^,a)t k , 

k=0 


Proposition 13.1. We take m e N. If we write a > 0 as 


a = a\w\ + a 2 W 2 + ■ ■ ■ + a r w r (a \ , 02 , ... , a r e R), 


we have 

£(1 — m,w,a) 


= (-iy(m— 1)! Y, FI 


Bm J ( Cl i ) I mi— 1 m r — 1 


m \ H \-m r =m-\-r—\ \j — 1 

m i >0, 0 


nij ! 


w 


■W' 


= (-1 ) r (m- iy.C m+r -i(w,a). 


Proof. The proof is based on the same method as the one used for looking at 
special values of the Hurwitz zeta functions. In the same way as in the proof of 
Proposition 9.3, we have 

1 \ r AwiH h w r -a)t 

lim ) / t~ m dt 

r(s)(e 2n,s - 1) / J I{e) {e K '‘ -!)■■■ (e w r‘ - 1) 


£(1 — m, w, a) 
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1 )! 

2ni 



e (\ -aj)wjt 

e w J t - 1 


dt. 


Now by using the generating function of Bernoulli polynomials (p. 56, (4.2)) and 
Proposition 4.9 (4), the Laurent expansion of the integrand is obtained as follows: 


e (l -aj)wjt 

n = r 

1 i e Wjt _ J 

7=1 


oo oo r / 

e ■■ ■ e n ( Sffl 7 

AW 1=0 AW;- =0 7=1 v 


C 1 — «/) 


(w y r)” 


OTy! 


-) 


OO oo 

= E - E 

AW 1 =0 AW r =0 


(-l)'"7B,„ 7 ( a/ )w” 


^.awi H Ym r —r—m 


The residue at t = 0 of this integrand is given by 


E 

AW 1 H |-AW r =AW+A - — 1 

AA2l>0, ...,m r >0 


n <->)■ 




mj ! 


Using the generating function of Bernoulli polynomials in Proposition 4.9, we have 


E 


^awi (^l) ' * * Bm r (^r) mi 

^ r — w i 

m\\ • • *m r ! 


. . . l-/n r 


=n 


( Wi t)e a ‘ Wit 
e w ‘ ‘ - 1 


(Wi ■■■ W r ) 




n;=i(^ r -i)' 


so we get the final equality of the proposition. 


□ 


13.2 The Double Zeta Functions and Dirichlet Series 

Now we set w = (vvi , i-vt). It will be a very interesting problem to study what kind 
of zeta functions appear if we cut out the contour integral representation (13.1) of 
the Barnes double zeta function £($, w, a), in the same way as we did when we 
obtained the functional equation of the Hurwitz zeta functions (Theorem 9.4). 

In order to answer this problem, we introduce a new Dirichlet series. For each 
real algebraic irrational number 5 a, let us consider the following Dirichlet series: 


5 A real algebraic number which is not a rational number. 
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£0 ,<*) 


E 


coin not 
n s 


Since cot(x) is unbounded, we need to show that this converges when Re(.y) is big 
enough. 

By virtue of Roth’s theorem on Diophantine approximation in transcendental 
number theory, the following things are known. Assume that a is an algebraic 
irrational number. When we take any positive number e, there exists a positive 
constant C ( s ) such that 


> C(s)n~ 2 ~ e (13.2) 

for arbitrary integers m. n (n > 0). Now for a real number x, we use the notation 
((x)) for the distance to the nearest integer from x. We have 0 < {{*)) < 1/2- 
Using this notation, for any integer n, (13.2) is expressed as 

{{not)) > C(s)n~ ] ~ t: . 

When 0 < x < 7i / 2, the inequality sin x > —x holds, so 



| cot7rna;| < 


1 

sin 7t ((not)) 


1 

2 ((««» 


< 


2 C(s) 


1+E 


By this estimate, if we put a = Re(y), we can take 


1 

2C(e) 


E 


n 


l + E 


n a 


as a majorant series of ^ (s , a), and it converges for a > 2 + s. Since we can take 
arbitrary e, £(s, a ) converges for 6 Re(.v) > 2, and moreover it converges uniformly 
on compact sets, so it is a holomorphic function of s in that range. 

We denote by SL 2 (Z) the modular group which appeared before (p. 76). If we 


let an element V 



of SL 2 (Z) act on a real algebraic irrational number a by 


6 In fact, if we use Roth's theorem more efficiently, we can show absolute convergence for Refs) > 
1 (see [6]). 
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T , aa 4- b 

Va = 

coi + a 

then Va is again a real algebraic irrational number. 

Let co and a be positive numbers and consider the following double zeta function: 

OO 

£ 2 (s,co,a) = ^2 (m + nco+a)~ s . 

m, n=0 


In the former notation, we have co, a) = '(, (s, ( 1 , co), a). We put 


/ 0 ) = 


(2 nf 

f(s) sin(jrs/2 ) ' 


The zeta function £ 2 (s,a>,a) converges absolutely for Re(,s) > 2, is holomorphic 
there, and is continued analytically to a meromorphic function on the whole ,v -plane, 
and its poles are s = 1,2 and of order 1 . 

U sing the contour integral representation (13.1), the principal part of the Laurent 
expansion at s = 1 is given by 


i; 2 {s,co,a ) 


/ 1 + tw a \ 1 

\ 2(8 co ) s — 1 


(13.3) 


Proposition 13.2. Let a > 0 be a real algebraic irrational number. We put 
V = (To 1 ) (n e Z). Then for the range Re(i) > 2, we get the following 
transformation formula: 


a s Va) — t- (s,a) = (1 — a s ! ) cot(jrs/2)f ( 5 ) — y(i)^ 2 (l — s,a. 1). (13.4) 


Proof. We start from the contour integral representation of the double zeta function 
£ 2 Cl or, 1) given by 


£2 (lot, 1) = 


r(s)(e 


^ - 1 ) L 


+s-l 


Hs. 00 ) (e ? -l)(e“'-l) 


dt. 


(13.5) 


This expression is obtained if we put r = 2, w\ = 1, ^2 = 0 ;, a = 1 in (13.1). 
Since the right-hand side is continued analytically to a meromorphic function on the 
whole s -plane, we consider the range Re(.v) < — 1. We take to < 0 and let C(to) be 
the vertical line Re(f) = to, oriented upwards: 


C(?o) : t = to + iy (—00 < y < 00 ). 
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In the same way as in the proof of Theorem 9.4, shifting the contour 7(e, oo) to 
the left to C(fo) and using the residue theorem, we can write 


h(s,a, 1) 


r(s)(e 


2jtiS - I) { luo) 


+s— 1 


(e 1 - \)(e at - 1) 


dt 


-2ni Y 

77 6Z-{0} 


(2tt i n) 


5—1 


(2 Jiin/ay 1 1 

g— Inina g2 ninja J ^ Of 


and then taking the limit to — >• — oo and noting 

111 1 

= — cotjrno! H — and „ . , 

2 i 2 e 2nm/a _ j 


1 —e 


—Ini 


1 nn 
— cot — 
2 i a 


1 

2 ’ 


we have 


1) 


-2ni(2jty- 1 [ e nis < 2 (\ + e nis ) 


r(s)(e 2nis - 1 ) 




p nis/2t-\ _ p nis\ j 

+ Yi ” (1 _ “ _S) ^ 1 - J ) 


The transformation up to here is valid for Re(s) < —1. Substituting 1 — .s' for 5 and 
using r(s)r( 1 — s) = n / sin ns, in the range Re(.v) > 2 we have 


£>(1 ~s,a, 1) 


e ~ nis l 2 ( \ — e ~ nis ) 


= {2n) s r(s) sin7r.s | '\_ 2 ~ js 1 ( %(s, a) + a s *£( s, f 


1 — e 


+ 


ie -nist 2 (i + e - _ 


1 £—2 nis 


-(l-cOt(s) 


i 

yC?) 


tj(s,a) — Q!‘ ?— 1 ^ ^ 5 , ^ + (1 — a 5-1 ) cot(jrs/2)^(s) \ 

and simplifying this, we have 

a i-1 £^s, — j — £(s, a) = (1 — a* -1 ) cot(ns/2)^(s) — y(s)% 2 ( 1 — s, a, 1). 
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Therefore, noting £(a + n,s) = t-(a, .v), we obtain the assertion of the theorem. 

□ 


The transformation formula (13.4) can be extended to the following form. For 
reR, we denote by { x j the real number which satisfies x — {.\'j e Z, 0 < {x} < 1 
(fractional part of x). 


Theorem 13.3. For a real algebraic irrational number a and V 



G 


SL 2 (Z) which satisfies c > 0 andca + d > 0, the following transformation formula 
holds in the range Re(.v) > 2. 


ij s '£(.y, Va ) — f (s, a) 

= (1 - rj s ~ 1 )cot(7ts/2)^(s) - y(s) ^ &(1 - s, rj, xj + yj rj). 

j mod c 


Here we put rj = ca + d and j runs over integers which represent residue classes 
modulo c, and for each j mod c, we put xj = 1 — {jd /c}, yj = {j/c}. 


Proof. See [6] and [7]. 

If we put V = Go) in this theorem, we get the transformation formula in 
Proposition 13.2. □ 

From now on, we always assume that a is a real quadratic irrational number. 
Then there exists V = ( ab ) e 5L 2 (Z) which satisfies the conditions 


Va = a, ca + d > 0. 


(13.6) 


This is easily understood if we think as follows. Let F be the real quadratic field 
Q(a). Let L = Z a + Z be a lattice generated by a and 1 and O(L) be an order 
associated to L, that is, O(L) = {u e F \ uL C L}. We denote by E(L) the group 
of all totally positive units of O(L). We take an element ij of E(L) such that t] f \ . 
Since rj & L and rja e L, if we define a 2 x 2 matrix V with coefficients in Z by 



then we have V e SL 2 (Z) and rj = ca + d > 0. In order to make c > 0, it is 
sufficient to take rj so that rj > 1 if a > a (a is the conjugate of a), and to take // so 
that 0<)7<lifa<c7. 

Applying Theorem 13.3 to this situation, we get the following theorem. 

Theorem 13.4. When a real quadratic irrational number a is given, we take a 
totally positive unit rj of F such that 
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r = (‘ ]) «e sl i( z> 


with c > 0. Then in the range Re(.s) > 2, we have 

£(*,«) = -cot(7rs/2)f(s)+ E hV-S’V’Xj + yjrj). (13.7) 

^ j mod c 

Through this expression, £(5, a) is analytically continued to a meromorphic function 
on the whole s-plane, and the poles are s = 1 and 5=1 + ( n e Z, n / 0) 

and they are of order 1. Moreover, its values at s = 2m — 1 (m e Z, m > 2) are 
expressed using Bernoulli polynomials: 1 


£(2 m — 1, a) = 


(— l) m -'(2 7r) 2m_1 

J yj2m—2 


E E 

A:=0 7 mod c 


Bk(Xj)B2m-k(yj) 2m— k— l 
k\(2m — k)\ 


Proof. The expression (13.7) is immediately obtained by using Va = cr in 
Theorem 13.3. Poles can possibly appear only at s = 2, 5 = 1,5= 1 + ( n e 

Z, n ^ 0), and 5 = 0. For s = —2, —4, —6 we have £(5) = 0, so we note that 

these are not poles. As for 5 = 2, as we remarked in the footnote on p. 213, £(5, a) 
converges absolutely for Re(5) > 1 and is holomorphic there, so this cannot be a 
pole. Now we consider 5 = 0. Taking the result that the principal part of fois, a), a) 
at the pole 5 = 1 is given by (13.3) and the fact £(0) = — | into consideration, the 
residue of the function £(5, a) at 5 = 0 (possible pole of order at most 1) is given by 

1 2 yo / I + h Xj + yjT) \ 

71 Jt( 1 — t]~ v ) I 2 n n / 

v j mod c x ' ' ' 

We see easily that this value is zero. So £(5, a) is holomorphic at 5 = 0. 

Lastly, the values at 5 = 2m — l(m e Z. m > 2) are, by virtue of (13.7), given by 

w. , 0 ( l) m— 1 (2 tt ) 2m— 1 

b(2m — \,a) = — > t 2(2 — 2m , n, x i + y i ri) 

5 (1 — n 2m ~ 2 )(2m — 2)\ ^ S V ' J ~ xj 

j mod c 

and by Proposition 13.1, we get the expression using Bernoulli polynomials. □ 


7 When a is a unit of a real quadratic field, this kind of special value has been calculated by Berndt 
[14]. 
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13.3 %(s,a) and Continued Fractions 

As explained in detail in Zetafunktionen und quadratische Korper by D. Zagier [ 1 07, 
§13], any quadratic irrational number /) can be expanded into a continued fraction 8 
of the following form. 



(bj eZ, b 2 , b 3 ,... >2). 


It is important that all the coefficients bj (j > 2) except for the first one are integers 
bigger than or equal to 2. Here the sequence {b\, b 2 , . . .} is periodic, namely, there 
exist some natural numbers v and r such that for j not less than v, we have 


br+j = bj (j > v). 


We call the smallest such natural number r the period of j3. We write this continued 
fraction expansion as 


P = [[b u b 2 b v -i,b x b v+r - 1 ]]- 


(13.8) 


The sequence {b v , ... ,b v + r _t} is called a fundamental period. Moreover when 
v = 1 , that is, if it is periodic from the first term, we say that the continued fraction 
expansion of /I is purely periodic. 

For a real quadratic irrational number p. We denote by /3 the conjugate number 
of p. Now we assume that the continued fraction expansion of a real quadratic 
irrational number co is purely periodic and we write 


co = [[m,n 2 n r ]] (rij > 2 ). 


Then we have 


1 


= 77 ,- 1 , ]] 


[107, p. 136, (17)]. By this expression, we have 


co > 1 and 0 < co < 1 . 


(13.9) 


8 The continued fractions treated here are different from the usual continued fractions of real 
quadratic irrational numbers. 
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If a real quadratic irrational number co satisfies condition (13.9), we say that co is 
reduced. Conversely, if co satisfies this condition, we can show that the continued 
fraction expansion of co is periodic, so co is reduced if and only if the continued 
fraction expansion of co is purely periodic. 

Let F be a real quadratic held and regard F as a subheld of the real numbers R. 
Moreover we define a mapping of F into R x R by 

F(0) = (0,0) € Rx R, 

and identify 0 with the point (0, 0) of R x R. Now we assume that co € F — Q is 
reduced and that the continued fraction expansion of co is given by 

co = [[«i .«2 n r ]]. 

We dehne an integer sequence {n j }jez by extending the continued fraction 
expansion {n \,n 2 , • • • , n r } by periodicity so that « /+r = tij for any j e Z. For 
a positive integer i we dehne a reduced number co, by 

0>i = [[»/,»! + 1, ■ • • ,H/+r-l ]]■ (13.10) 

Of course we have co\ = m r +i = co. For n € Z, we put 

v (n) = e SL 2 ( Z). 

From the continued fraction expansion (13.10), we get 

(Oi = Hi — = V(ni)co i+ u 

(Oi + 1 


so we have 


co = V(n i ) F(« 2 ) • ■ ■ V(n r )co. 


We dehne now a lattice L of F by 


L — Z co Z 

and take a generator s of the group E(L) of totally positive units of O(L) so that 
s > 1 . We dehne V by the relation 


9 This is different from the usual definition of reduced for the setting of ordinary continued 
fractions. Usually a real quadratic irrational number a is called reduced when a > 1 and 
0 > a > — 1 . 
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If we think of the stabilizer subgroup r m = {M e S L 2 (Z) | Mm = m} of a>, we 
see r w is generated by V and ±/2 {1 2 is the identity matrix). Because c > 0 and r 
is the period of m, we have 


V = V(ni)V(n 2 ) — V(n r ). 


For any Vq g S L 2 (7 j ) and a real algebraic irrational number a, we use the notation 
of an automorphy factor 


J(V 0 ,a) = ca + d. (13.11) 

As in the case when a is in the complex upper half plane, this has a property 
J{V\V 2 .a) = J(V\,V 2 d)J{V 2 ,a) for any V \ , V 2 € SL 2 (Z). Then using this 
property of the automorphy factor (13. 1 1), we have 

s= J(V,m) = J{V{n\),m 2 )J{V{n 2 ),m 2 ,) ••• J(V(n T ),m) 

= ft)2&>3 ■ • • CO r CO. 


So we have 


s = twi ®2 • • ■ m r . (13.12) 

Now, we put £0 = 1, £1 = o) = co r +i, £ 2 = u>r+\m r , . . . , £ r = co r +ico r ■■■ m 2 . Of 
course we have £ r = s. Here we have 

L = Z£,- + Z£,+i (0 < i < r — 1), (13.13) 

because by the relation £,_i = £,■ x — - — we have 

J Si 1 SI a> r+2 -i 

zti-t + zSi = £,Yz— — + z 

V <O r +2-i 

= £*■ {Z{ri r -\-\—i m r +i — () + Z) 


— Z £; + Z^ + l 


and starting from / = 1 inductively we can show the assertion. 
Now we define a double zeta function of a lattice L by 
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S L {s) = Y e ~° ' (13-14) 

6eL n C(l,e) 

Here we denote by C ( 1 , e) the simplicial cone given by 

C(l,e) = {(x + ye, x + ye') \ x > 0, y > 0}. 

As we shall see later, E;(s) is written as a finite sum of double zeta functions 
explained before, so it converges absolutely for Re(.s) > 2 and is holomorphic in 
that range. Let R+ be the set of positive numbers. The group of totally positive units 
E(L) acts on R+ x R+ by 

r)(u,v) = (r]u,rjv) (rj e E(L), (n, v) e R + x R+). 

The set C(l, e) is a fundamental domain of R+ x R+ by E(L). 

Since ay > 1 for each i and since we see 1 < < f;+i < e and e < ^, + i < 

£,• < 1 , we can write 


C(l, s) = {x%k - 1 + y%k I x > 0, y > 0} (disjoint union), 

k= 1 

so by (13.13) we get 

r 

LflC(l,e) = [_J {mijir-i +«£«: | m e Z>o, n e Z>o} (disjoint union). (13.15) 
k= t 

This decomposition was given in Zagier [105]. Here we denote by Z>o or Z>o the set 
of positive integers, or integers not less than 0, respectively. By the decomposition 
(13.15), we get 


3i.(s) = Y (h-\) s h(s,a) r+2 -k, 1). (13.16) 

k= 1 

By this expression, Ei(s) converges absolutely for Re(.v) > 2 and is continued 
analytically to a meromorphic function which is holomorphic in the whole ,v -plane 
except for s = 1 , 2. 

Between the functions £(.?,&>) and E / (,<r ) , the following functional equation 
holds. 

Theorem 13.5. Let co be a reduced number of F . Put L = 7 + Zw and let s > 1 
be a totally positive fundamental unit of the order O(L) of L. Then we have 

Y(s) 

e 1 - 1 - 1 


t;(s,a>) = —cot(ns/2)t;(s) — 


El(I-s). 


(13.17) 
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We can also show through this equation that £( s , a>) can be continued analytically 
to a me romorphic function of s. 

Proof. Since we have now Wj = V (/?,■ )gj, + i and or, > 0, by Proposition 13.2, we 
have an equality 


{(Dr+2-kY '£(5, COr+l-k) “ ^r+2-k) 


= (1 - {(O r+ 2-k) S *) COt(7Tj/2)£(j) - J/(j)C2(l - S,(O r +2-k, U 

for 1 < k < r. Multiplying (fk-i )' 1 by both sides of this equality, then taking the 
sum from k = 1 to k = r and using (13.12) and (13.16), we get 

(e s_1 - l)£(s,a>) = (1 - £* -1 )cot(jr.s/2)£(.s) - y(s)S L ( 1 - s). 


□ 

Remark 13.6. If we cut out the contour integral representation, not only of a double 
zeta function, but also of a multiple zeta function in the same way as in Sect. 13.2, 
then a kind of Lerch type zeta function appears. It would be an interesting problem 
to investigate what kind of modular properties it has. 


Chapter 14 

Poly-Bernoulli Numbers 


In this chapter, we define and study a generalization of Bernoulli numbers referred to 
as poly-Bernoulli numbers, which is a different generalization than the generalized 
Bernoulli numbers introduced in Chap. 4. 


14.1 Poly-Bernoulli Numbers 


We give a definition in a down-to-earth way. 

(k) 

Definition 14.1 (Poly-Bernoulli number [56]). Poly-Bernoulli numbers B„ e 
Q (n > 0, k e Z) are defined by the generating function 


Lik( 1 ~e ') 
1 -e-< 




. r 

Here, for any integer k , Lik(t) stands for the formal power series > —r. 

' n k 

n — 1 

When k > 1, the series Lik(t) is the defining series of a “polylogarithm” 
function, whereas when k < 0, it is the Taylor series of the rational function 

It) (ct) at t ^ e or igi n - If k = 1, we have Li\(t) = — log(l — t ) and 
Li\{\ — e~ *) = t, hence by Theorem 1.12 on p. 20, the number B,*, 1 * is nothing 
but the original Bernoulli number B„ . If k > 1 , by making a change of variables 
tj m* 1 — e~‘‘ in the following iterated integral expression of L// f (/), 

1 dtk r‘ k dtk-i C‘ 3 dt2 f' 2 dt\ 
h Jo fjt-i Jo h Jo 1 — 1\ 



T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2_14, © Springer Japan 2014 
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Table 14.1 (0 < k < 5, 0 < n < 7) 


k\n 

0 

1 

2 

3 

4 

5 

6 

7 

0 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

2 

1 

6 

0 

1 

30 

0 

1 

42 

0 

2 

1 

1 

4 

1 

36 

1 

24 

7 

450 

1 

40 

38 

2205 

5 

168 

3 

1 

1 

11 

1 

1243 

49 

75613 

599 

8 

216 

288 

54000 

7200 

3704400 

35280 

4 

1 

1 

49 

41 

26291 

1921 

845233 

1048349 

16 

1296 

3456 

3240000 

144000 

1555848000 

59270400 

5 

1 

1 

179 

515 

216383 

183781 

4644828197 

153375307 

32 

7776 

41472 

194400000 

25920000 

653456160000 

49787136000 


we have the following expression of the generating function: 


Lik{ 1 ~e ') 
1 -e-* 


l 

I “ dt k 

r‘ k dt k ~ i 

rtk-\ 

f 3 dt 2 r 

e< - 1 

Jo e ‘ k ~ 1 

J 0 e lk ~' - i 

L 

Jo e't-lJo 


(14.1) 


This shows that, in the case of k > 1, the generating function of 

(k) 

poly-Bernoulli numbers B„ is obtained, starting with 1 , by applying the integration 
L dt and division by e' — 1 successively k times, and finally by multiplying e* . 


Remark 14.2. (1) Our poly-Bernoulli number is different from the “Bernoulli 
numbers of higher order” defined by Norlund [75, Chapter 6], The generating 
function of the Bernoulli numbers of higher order is (;rry) an d this can be 
written as a linear combination of derivatives of -f—r. Hence the Bernoulli 

number of higher order is essentially a linear combination of classical Bernoulli 

(k) 

numbers. As is shown below, ourB„ can also be expressed in terms of classical 
Bernoulli numbers, but we need to take products successively. 

(2) There is a very interesting object called “multiple zeta values” (cf. [57,98, 109]), 
which generalize the values at positive integer arguments of the Riemann zeta 
function. The poly-Bernoulli numbers are in some way related to the multiple 
zeta values [9] . 

Table 14.1 gives some values of for k > 0. 

The next two propositions give recurrence formulas for Bjf *, which can be 
derived from the definition and the iterated integral expression of the generating 
function. 

Proposition 14.3. For any k e Z and n > 0, we have 


We regard the sum on the right as 0 if n < 1. 
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Proof. We differentiate the relation L 4(1 — e ') = (]— e ' ) 




which 


is obtained from the definition by multiplying by 1 — e '.By the relation Li', (?) = 

1 

jLik-i(t), we have 


Lit t-i(l-e 0 

e 

\-e~< 


00 t n 00 ,n-l 

n=0 * n= 1 V 7 ‘ 


J—/1 g ^ OO ^yi 

Multiplying by on both sides and using = — , we have 

1 — e~ l n ! 

71=0 


00 t n °° t n °° fti — 1 

E »?“' ^ = E »?’ S + < e ' - » E 77 TT 


71=0 


71=0 


71=1 




E^^ + EyiE®? 1 


in - D! 


/71 — 1 


71=0 


E b ?’^t + E(E 


/ = 1 71 = 1 

OO / 71 — 1 


(« - 1)! 


1 


= 1 \m =0 


»(*) 

m+1 (77 — m)!m! 


it*) 

*0 


oo / 


+ E <' ) + E 


=i \ 


77 

777 , 


»(*) 


f" 


m+1 / 77 !' 


Hence we have 


d(*“ 1) _ rat*) ; 


"0 


0 


if 77 = 0, and if 77 > 1 we obtain 


71 — 1 / 


71 — 1 / 


■?“’ = b«> + E C, = <» + Dif + E LI i 

717=0 \ / 711=1 \ 


This divided by 7? + 1 gives the proposition. 
Proposition 14.4. For k > 1 and n > 0, we have 


V = E(-D” 


f ft 
i ft? , 


-i) 


E^( 7 K 


\l = 0 


Proof. We use the iterated integral expression of the generating function, which 
shows 


Liki 1 ~ g r ) _ e' r e - s Li k - i(l - e *) 

1 - e' - 1 io 1 - <?~ s 
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From this we have 


( °° /«-l\ pt 00 / n 

E D-') 1 

n=0 ' / ;i=0 \m=0 


— I ds 


1 n ' 
i m , 


/ 00 ,n-l \ OO / n 

= E EH) 


\ 7?=0 
OO//? 


/ ; 7 = 0 \777 = 0 


= EIE^f; ten. 


l—n 


77 =0 \/=0 
00/77 


V77 = 0 


' n \ 

\ m ) 

C 


tr-l) 


t-1) 


7 /;! 

f" +1 
(n + 1)!' 

Ds 




= £ E(-d 




=0 \777 =0 


A / + 1 1/ / U 


5 ,,-/ 


V=m 


77! 


Use (") ( m ) = (”) (" J?) and replace / — > 77 — 1 and then m — >• 77 — 777 to obtain 


Et- 1 ) 


777 =0 


\f-l + 1 / m, 




= E<-»" 


' 77 
k 777 , 




which gives the proposition. □ 

Remark 14.5. In the above two recurrences, we need the numbers with different 
(k— 1) (k) 

upper index like B„ in order to express B„ . We do not know if there exists a 
recurrence formula with the single index k . 

The following theorem generalizes Theorem 2.8 (p. 35), which expresses 
Bernoulli numbers by Stirling numbers. 

Theorem 14.6. 



<-D" E 

777 =0 


(m + 1 ) k 


(n > 0, k € Z). 


Proof. The proof goes in the same way as Theorem 2.8. In fact, by using 
Proposition 2.6 (7) (p. 30) we have 




14.2 Theorem of Clausen and von Staudt Type 


227 


= Lik{l ~ e ^> = V (1 “ O '"- 1 

^ " 72 ! 1 — e~’ 


n = 0 


m K 


_ ^ (— i)'"(e _r - i;r _ A (— i) m w! A 

(jfi -(-1)^ (m -U 1 


(m + \) k 


n 1 (-0" 


m n ! 


~ / " (-l) m w!{"]\ t n 


Comparing coefficients of ^ on both sides, we obtain the theorem. 


14.2 Theorem of Clausen and von Staudt Type 

In this section we discuss denominators of poly-Bernoulli numbers. The denomina- 
tors of the classical Bernoulli numbers are completely determined by the theorem 
of Clausen and von Staudt (Theorem 3.1, p. 41). The theorem not only determines 
the denominator but also describes the “fractional part” of B„ . In the case of general 
1 we only have partial results below, but when k = 2 (“di-Bernoulli numbers”), 
the denominators are completely determined. Note that, by Theorem 14.6, no prime 
greater than n + 1 appears in the denominator of 

Theorem 14.7. Let k >2 be an integer and p be a prime number satisfying k +2 < 
p < n + 1. 

(1) When p — 1 | n, p k M^ is in Zq,) (the set of rational numbers whose denomina- 
tors are prime to p) and 


p k B® = -1 mod p. 


(2) When p — 1 \ n, p k 1 is in Z fp ) 




1 

P 


p- 1 J 2 k 


(- 1 ) 


1-1 


n 

P~ 1 


and 
mod p 
mod p 


if 11 = 1 mod (p — 1), 
otherwise. 


Proof Denote by ord p (a) the /-;- order of a rational number a. We have 
ord /; ( p l ) = t and both numerator and denominator of a ■ ^~ OR V a ) are prime 
to p. We proceed in the same way as in the proof of Theorem 3.1, using the formula 
in Theorem 14.6 and calculating the p - order of each term in the formula. Put 

b ( „ k> (in) = * ('1^” (i" 1 ^ • We prove (1) and (2) simultaneously. 
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Write m + 1 = ap e ,(a.p ) = l,e > 0. If e = 0, then bl, k \m) e Z( P ) 
and p k ~ 1 bj, k \m) = 0 mod by the assumption k > 2, hence this term can be 
discarded. Note that, since the Stirling number } is an integer, we have 



First assume e > 2. We show the congruence p k b\ k \m) = 0 mod p and further 
p k ~ l b) k \m) = 0 mod p if p — 1 \ n. Using or& p (m\) = |^J, we estimate 



> 

in ' 

— ek = 

" ap e — 1" 

,p . 

. P . 




> p e ~ ] - 1 - ek = (1 + p - 1 Y~ l - 1 -ek 

> 1 + (e — 1 )(/? — 1) — 1 — ek = (e — 1 )(p — 1) — ek 

> (e — l)(k + 1) — ek = — k + e — 1 

> -k + 1 . 


From this we obtain ord p ( { ,n+\ f ) — ^ all( ^ P k 1 b!, k \m) e Z( P ). 

Therefore p k b) k \m) = 0 mod p holds. If at least one inequality in the above 
chain of inequalities is strict, we get p k ~ l b) k \m) = 0 mod p. If all the inequalities 
become equalities, we have e = 2, m + 1 = p 2 , and p = k + 2. In this case, the 
lemma below (the case a = p) shows p k ~ l b\ k \m) = 0 mod p if p — 1 | n. 

Lemma 14.8. Let n and a be natural numbers. We have the congruence 


n 

ap — 1 



mod p 


ifn = a — 1 + c(p — 1) for some c > a, 


0 mod p otherwise. 


We use the generating function of the Stirling numbers {" J (Proposition 2.6 (8) 
on p. 30) 


E 



(1 — f) (1 — 2 1) ■ ■ ■ (1 — mt) 


(14.2) 


When m = ap — 1, the right-hand side reduces modulo p to 
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{ ap- 


I 00 / 

= t ap ~ l Y (' 



t a-\+(a+i)(p-\) 


Here we used (1 — 1)(\ — 2t) ■ ■ ■ (1 — (p— 1)?) = 1 —t p ~ x mod p. Putting a + / = c, 
we obtain the lemma. □ 

Next suppose e = 1 (m = ap — 1). If a > 3, then p 2 \(ap — 1)! ensures 

ordp ( ( „,+i)A- ) > — k + 1 and this gives p k ~ x b^\m ) = 0 mod p. If a = 2, 

we have oxd p {m\) = 1 and ord p = 1 — k, and hence ord p (bl, k \m)) = 

1 — k + ordp ({,",})• From this, we have p k b^\m) = 0 mod p. If n ^ 1 
mod (p — 1), by putting a = 2 in the lemma above we have \ 2 p_ \ ) = 0 mod p. 

This gives us p k ~ l bj, k) (m) = 0 mod p if n ^ 1 mod (p — 1). If n = 1 

mod (p — 1), by writing n = 1 + c(p — 1), we have c > 2 (note c = 1 never 

happens because n > m) and the lemma again gives \ 2 p- \ } = c — 1 = — n mod p. 
Hence we have 


k-\ 




k - 1 


(-\) 2 P-'(2p - \)\{ 2 p_\\ 
(2 p) k 


= mod p. 


(We have used Wilson’s 1 theorem (p — 1)! = —1 mod p, which implies 
(2p — 1 )\/p = (( p — l)!) 2 = 1 mod p.) Lastly, when a = 1 (m = p — 1), 

we have bl, k \m) = — — | ;A l/ ’~ 1 ^ . By the lemma we have { /; " J = 0 mod p if n ^ 0 

mod ( p — 1) and hence p k ~ l b) k \m) = — ^{ " J mod p. If n = 0 mod (p — 1), 

then we have { ;) ",} = 1 mod p and p k b) k \m) = — 1 mod p. Combining all 
these, we obtain the theorem. □ 

Remark 14.9. Let k and p be as in the theorem, and assume n ^ 0. 1 mod ( p — 1). 
Denote by n' the unique integer satisfying n' = n mod p — 1 and 1 < n' < p. 
Then we have 


p k ~ l = ( 77 -/ 7 /)— mod p. 

n 

If in particular n is odd, we have p k ~ l M ( „ k) = 0 mod p. 
This comes from the congruence 


'John Wilson (bom on August 6, 1741 in Applethwaite, England — died on October 18, 1793 in 
Kendal, England). 
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P 


n 

P~ 


= (n — n ) — 
n 


mod p , 


which is a consequence of Eq. (14.3) in the proof of Theorem 14.10. 

The denominators of di-Bernoulli numbers are completely determined. 

(2) (n ~ 2) 

Theorem 14.10. (1) If n is odd, we have B„ = — — B„-\. ( Hence in 

this case the determination of the denominator is essentially reduced to the 
theorem of Clausen and von Staudt ( Theorem 3.1).) 

(2) Suppose n is even (> 2). For a prime p, denote by ord(p,n) the / 2 -order 
( 2 ) 

of B„ . Then we have the following: 

(2-1) If p > n + 1, then ord(p, n) > 0 (no p appears in the denominator). 

(2-2) For p with 5 < p < n + 1, 

(a) ord(p.n) = —2 if p — 1 | n. 

(b) Suppose p — 1 \ n. 

£ 

(b-1) We have ord(p.n) > 0 if either p \ — or n = n' mod p(p — 1) for some 

n 

n' with 1 < if < p — 1, and 
(b-2) ord(p.n) = —1 otherwise. 

(2-3) We have ord(3,n) > 0 if n > 2 and n = 2 mod 3, and ord(3,n) = —2 
otherwise. 

(2-4) We have ord( 2, n) > 0 if n > 2 and n = 2 mod 4, ord( 2, n) = —1 if n = 0 
mod 4, and ord( 2, 2) = —2. 

B n 

Remark 14.11. The rational number — in (b-1) is always an element of Z (/7 ) 
(Theorem 3.2 (1)). 

For the proof, we need a lemma. 

Lemma 14.12. Let n > 2 be even, p > 5 a prime, and 2p — 1 = m. We have 


(— 1 ) m rn \ 



mod p 2 , 


(— 1 ) m m\\ n ) 

and hence 'f— € Z i n) . 

(m + l ) 2 {p) 

Proof. By Proposition 2.6 (p. 30), we have 
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P - 1 


^Eh-^v, Y nH ~ l)l Vi-\Y~ 2npin ~ x +n \ modp2 - 


2p-\ 


1=1 


Using ( P l ) + ( i_ x ) = t( i-\ )’ we see ^ le ^ ast sum * s ec l ua l t0 


p - 1 


2P~ 1 


2 J P(1-«)E(“1) / ^ / _! J 1 


Combining 


<2 p- V 
i l-l , 


= (-l) 


/-l 


mod p 


and p — 1 \ n — 1 (since n is even and p is odd), we have 


p - 1 


E ( - 1 ), ( 2 f _ 1 1 ) / ' ,_1 = -E /,,_1 = ° mod p- 


p - 1 


/=i 


l=i 


Proof of Theorem 14.10. The iterated integral expression (14.1) of the generating 
function of B, ( , 2 * and ^-j- = ^“ 0 (— 1 give 


t n C 1 o' 

Vb ®- = / Y (- iy Bl - 

H " n\ e<-\ J 0 /! 


— t/y 


1=0 
— t 00 




f/ + l 


1=0 


(1 + 1 )! 


From this we obtain 


= E*- 1 *' 
1=0 



Bn-lBl 

l + 1 


If 72 is odd, one of 72 — / and / is odd. Since B/ = 0 for odd / > 3, we have for n > 3 

if = -^,,-,5! + * fi„-I . 


This also holds for n = 1, and thus (1) is proved. 
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(2-1) is clear from the formula in Theorem 14.6. By elementary computation, one 
sees that the only cases where )2 in that formula is not an integer are m + 1 = 

8, 9, prime, 2 x prime. By Lemma 14.12, one sees that p does not appear in the 
(-lfm'l"! 

denominator of — (m+1 )2 if m + 1 =2 p, ( p : prime > 5). Next consider the case 

of m + 1 = p ( p : prime > 5). In this case, we have 

p -\ / _ A p~ x 

= D- 1 ) / r z r = mod p- 

This reduces mod p to — 1 if p — 1 | n, 0 if p — 1 \ n. Hence the p - order of 
( i s — - if P — 1 | n. The other terms being in Z^, this establishes (2-2)- 

(a). Suppose p — 1 \ n. Noting the congruence 

^ i 

= (-1)' + (-1) ,_1 /?^ T mod p 2 
; = 1 1 

(expand (p — l)(p — 2) ■■■(/> — /)), we have 



(— 1 ) m m\ 


p - i / ,\ p - i p-i / , 

Ec-d'I , , " = E'"-''E'"E7 mod ^ 

/=i \ / /=i /=i i=i ( 

It is known that if n is even and p — 1 \ //, the congruence 

p-i 

l' 1 = pB n mod p 2 

i=i 


holds (cf. Ireland and Rosen [50, Cor. of Prop. 15.2.2]). On the other hand, if n = n' 
mod p — 1, 1 < n' < p — 1, then n' is also even and by Vandiver 2 [93, (63)] (see 
also Remark 14.24) we have 


E'"E 


/=i 1=1 


1 

i 


p - 1 i , 

i=\ i=i 
B n / mod p. 


2 Harry Schultz Vandiver (bom on October 21, 1882 in Philadelphia, USA, died on January 9, 1973 
in Austin, USA). 
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This gives 


(— 1 )'"/«! 



P (B„ — B n r ) mod p 2 . 


By the assumption p — 1 \ n, we have — e Zq,) and B n ' = mod /7 

(Theorem 3.2). We therefore have (m = /> — !) 


(-1 )'";«! 


1 / >\ A 2 

I = p(n —n ) — mod p , 
m\ n 


(14.3) 


and (2-2)-(b) follows. As for the 3-order, the terms in the formula of Theorem 14.6 
which have 3 in the denominators are 

2 - {” } ~5!{s} 81111 

3 2 6 2 ’ 9 2 

Computation using Proposition 2.6 (6) (p. 30) gives (2-3). The determination of the 
2-order is similar and is omitted. □ 

By noting the appearance of the condition p | ^ in (2-2)-(b-l) of Theorem 14.10, 
we can state the irregularity of a prime p in terms of the denominator of the di- 
Bernoulli number Bj, 2> . 

Corollary 14.13. A prime p (> 5) is irregular if and only if there exists an even n 

(2) 

with p + 1 < n <2/? — 4 such that the denominator of B„ is not divisible by p. 

For classical Bernoulli numbers, odd indexed ones are 0 except for B \ , denomi- 
nators of even indexed ones are easily determined (Clausen and von Staudt), and 
the numerators of even indexed ones are mysterious quantities that are closely 
related to the arithmetic of cyclotomic fields. As for the di-Bernoulli numbers, 
the odd indexed ones are essentially the classical Bernoulli numbers and the 
determination of denominators of even indexed ones requires the numerators of the 
classical Bernoulli numbers. Can one expect any interesting arithmetic concerning 
the numerators of the di-Bernoulli numbers? We do not know. 


14.3 Poly-Bernoulli Numbers with Negative Upper Indices 

When k is 0 or negative, the poly-Bernoulli number Bju* is a positive integer 
(Table 14.2). We give yet another formula for B^ in this case, and introduce 
two different combinatorial interpretations of b(, A * for non-positive k which were 
discovered recently. 

We have the following formula. 
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Table 14.2 bJ,* 0 (-5 < k < 0, 0 < n < 7) 


k\n 

0 

1 

2 

3 

4 

5 

6 

7 

0 

1 

1 

1 

1 

1 

1 

1 

1 

-1 

1 

2 

4 

8 

16 

32 

64 

128 

-2 

1 

4 

14 

46 

146 

454 

1394 

4246 

-3 

1 

8 

46 

230 

1066 

4718 

20266 

85310 

-4 

1 

16 

146 

1066 

6902 

41506 

237686 

1315666 

-5 

1 

32 

454 

4718 

41506 

329462 

2441314 

17234438 


Theorem 14.14. For any n ,k > 0, we have 


min(« ,k) 

a lr k) = E o'o 2 


7=0 


;; + 1 
7 + 1 


k + 11 
7 + ir 


In particular, B,', ^ is positive and symmetric in n and k. 

Corollary 14.15. 


E ) = b[." ) . 


(—k) 

Proof. We compute the two- variable generating function of B„ with the aid of 

Theorem 14.6 and Proposition 2.6 (7), in the following way. 


n= 0 k = 0 


££ B !+^rT = ££<-«" £<-»■ 


n= 0 k = 0 
oo 


E(-D" E ( - 1)B 

n = 0 m = 0 


m! 


m! 


(m + 1 ) A “7 77 
n\ k\ 


x n 


IV. 


= E( _1 ) m,n!e(m+1)v E(-^ 


m=0 


n 


H I X 

w I n ! 


= E(-d 

m = 0 

-’(- 


-o' 

m\ 

1 


(~ey{e~* - 1)) 


o-r+r 


e x + ey - e x+y ' 


Furthermore, we have 


3 The following proof, which greatly simplifies the original proof in [10], is due to Hiroyuki Ochiai. 
The authors would like to thank him for providing this simple proof. 
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a x + y 


a x+y 


e x + ey - e x+ y 1 - (e x - 1 )(e? - 1 ) 


, x +y 


- \y {e y - \y 


j = 0 


oo 

= y — - 

( i -L 


^-(e x - yy + '^-(e y - 1 y +1 . 

f^ 0 (J + 1) dx dy 


Applying Proposition 2.6 (7) again, we obtain 

°° °° yYl 

n n \ k\ 


n=0k=0 


e( 

7=0 \ n= J 


7=o 

oo oo oo 

EEE<+ 

j=0n=j k=j 


+ 1 1 X" + 1 


j + lj (« + 1)! I dy 


£|E 


(£=7 


A: + 11 y i+1 


j + lj(fc+l)! 


n + 1 

y + i 


A' + 1 1 jc" / 

j + lj «! A! 


Comparison of the coefficients on both sides gives the theorem (note {"+J }{*+}} = 0 


when j > min(/i, k)). 

By the theorem, we also have a different generating function for By, k * . 

Corollary 14.16. We have 

OO OO oo 

EE® ^x"y k = ypj(x) Pj (y), 

7=o 


n= 0 k = 0 


where 


Pj (x) = 


jlxd 


(l-x)(l— 2x)---(l-(y + l)x) 
Proof. This is a consequence of 

n— j d 


which follows from Proposition 2.6 (8). □ 

Computing this generating function in a different manner, we obtain the 
following. 
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Proposition 14.17. For any n > 0, we have 


D- 1 ) = °‘ 

1=0 

Proof. Denote the left-hand side of Corollary 14.16 by B(x , y ). By Theorem 14.6 
and Proposition 2.6 (8), we have 


n=0 k=0 m=0 

oo oo ^ . 

E(-ir^!E(-i)" " ki— r- 

Z Z — ' m I 1 — ( 777 

m=0 n = tn ’ 


(m + 1 ) k x n y k 


m = o 


+ l)v 


m!t” 


^ (1 + x) (1 + 2x) • • • (1 + mx) (1 - (m + l)y ) 


(The term with in = 0 is .) Here we put y = —x and use Proposition 2.6 (8), 

ml = p’E j, and Proposition 2.6 (5.1), to obtain B(x, — x) = 1 and the proposition 
follows. □ 

Remark 14.18. This is trivial when n is odd, because of the symmetry in 
Corollary 14.15. When n is even, however, this is a non-trivial statement. For 
example, from Table 14.2, we have 1 — 2+1=0, 1 — 8+14 — 8+1=0, etc. 

In recent years, two different combinatorial interpretations of the poly-Bernoulli 
numbers with negative upper indices have been found. We briefly state these 
theorems, one by Chad Brewbaker and the other by Stephan Launois. For proofs, 
see their papers [20,68]. 

Definition 14.19. A matrix whose entries are 0 or 1 is called “lonesum” if it is 
uniquely reconstructed from its row and column sums. 

Theorem 14.20 (Brewbaker [20]). Let n and k be natural numbers. The number 
ofn by k lonesum matrices is equal to B,*, k \ 

We denote by ©„ the symmetric group of degree n, viewed as a permutation 
group on the set { 1 , 2 , ■ • • , n}. 

Theorem 14.21 (Launois [68]). Let n and k be natural numbers. The cardinality 
of the set 
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We end this section by proving a special case of a theorem of Vandiver, making 
use of the formulas in Theorem 14.6 and Corollary 14.15. 

Theorem 14.22. For an odd prime p and an integer i with 1 < i < p — 2, we have 

/ i j \ 

Bi = ( 1 + - 4 1 ) (m + 1)' mod p. 

z — ' V 2 ml 

m= 1 v ' 

Proof. By Theorem 14.6 and Fermat’s little theorem, 

Bj = B ( a) = mod p. 

By Corollary 14.15, the right-hand side is equal to B^Jj- Applying Theorem 14.6 
again, we obtain 


p - 2 

B . = _ £(_i y* m 

m= 0 



(m + 1)' 


mod p. 


It therefore suffices to show the following lemma (note {^q 2 } = 0). 
Lemma 14.23. If 1 < m < p — 2, then 


(- 1 )' 


1—1 


mi 


p - 21 , 1 

1=1+-+ 
m 2 


■ -| mod p. 

m 


Write (— 1)'" = b m . The recursion of the Stirling numbers (2.1) (p. 26) 

gives 


(— 1 ) m ~hn\ 


P-1 

m 


= m(-b m - 1 + b m ) (m > 2). 


On the other hand, by Proposition 2.6 (6) (p. 30), we have 


(— l) m_1 w! 


P- 1 
m 


!-+ l 'W 


Z=1 

= 1 mod p 


’ m 

i / , 


l p - x 


mod p 


and hence b m = b m -\ + A mod p. 

This together with b\ = { / ’^ 2 } = 1 proves the lemma and thus settles the 
theorem. □ 
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Remark 14.24. For 1 < i < p — 2, the right-hand side of the theorem is congruent 
modulo p to 



Vandiver’s original theorem (a special case of [93, (63)]) states that this is congruent 
modulo p to Bi . 

Exercise 14.25. Prove that if ^ is not an integer, then m + 1 = 

8, 9, prime, or 2 x prime. 

Exercise 14.26. Define the numbers C„ k) by the generating function 


Li k ( 1 


= Yc ik)t - 

^ " nV 


(1) Prove the formula 


... " (-l) m jn !{"+*} 

n ~ ’ ^ (m + \) k ' 

777=0 

(This is regarded as a generalization of Proposition 2.10.) 
(2) Prove the duality 


C { n _ k} = Ci n > (n,k> 1). 

Exercise 14.27. Prove that the number B, ( , k) is always even when n, k > 1. Hint: 
Use Theorem 14.14. 

Exercise 14.28. List all n by k lonesum matrices for small n and k, and check 
Theorem 14.20. 

Exercise 14.29. Prove the congruence stated in Remark 14.24. 


Appendix 

Curious and Exotic Identities for Bernoulli 
Numbers 


Don Zagier 


Bernoulli numbers, which are ubiquitous in mathematics, typically appear either as 
the Taylor coefficients of xj tanx or else, very closely related to this, as special 
values of the Riemann zeta function. But they also sometimes appear in other guises 
and in other combinations. In this appendix we want to describe some of the less 
standard properties of these fascinating numbers. 

In Sect. A. 1 , which is the foundation for most of the rest, we show that, as well 
as the familiar (and convergent) exponential generating series 1 


B, 


t = E^t x " = i- 

1 ^ n\ 


x 

= l-X + 7^- 


X 


+ 


X 


n = 0 


12 720 30240 


(A.l) 


defining the Bernoulli numbers, the less familiar (and divergent) ordinary generat- 
ing series 


P(x) = ^2 B n x 11 

n = 0 


2 4 

X X X 

1 — — -j- — H - 

2 6 30 



(A. 2) 


also has many virtues and is often just as useful as, or even more useful than, its 
better-known counterpart (A.l). As a first application, in Sect. A. 2 we discuss the 
“modified Bernoulli numbers” 


B 


* 

n 


n 


E 



B r 

n + r 


(” > 1 ) - 


(A. 3) 


'Here, and throughout this appendix, we use the convention B\ = —1/2, rather than the 
convention B\ = 1/2 used in the main text of the book. 


T. Arakawa et al., Bernoulli Numbers and Zeta Functions, Springer Monographs 
in Mathematics, DOI 10.1007/978-4-431-54919-2, © Springer Japan 2014 
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These numbers, which arose in connection with the trace formula for the Hecke 
operators acting on modular forms on SL(2, Z), have several unexpected properties, 
including the surprising periodicity 

s :+ 12 = B ; (n odd) (A.4) 

and a modified form of the classical von Staudt-Clausen formula for the value 
of B„ modulo 1. The following section is devoted to an identity discovered by 
Miki [A10] (and a generalization due to Gessel [A4]) which has the striking 
property of involving Bernoulli sums both of type Y B, B„- r and Y ( " r )B, B n - r , 
i.e., sums related to both the generating functions (A.l) and (A. 2). In Sect. A.4 we 
look at products of Bernoulli numbers and Bernoulli polynomials in more detail. 
In particular, we prove the result (discovered by Nielsen) that when a product 
of two Bernoulli polynomials is expressed as a linear combination of Bernoulli 
polynomials, then the coefficients are themselves multiples of Bernoulli numbers. 
This generalizes to a formula for the product of two Bernoulli polynomials in two 
different arguments, and leads to a further proof, due to I. Artamkin, of the Miki- 
Gessel identities. Finally, in Sect. A. 5 we discuss the continued fraction expansions 
of various power series related to both (A.l) and (A.2) and, as an extra titbit, 
describe an unexpected appearance of one of these continued fraction expansions 
in connection with some recent and amazing discoveries of Yu. Matiyasevich 
concerning the non-trivial zeros of the Riemann zeta function. 

This appendix can be read independently of the main text and we will recall 
all facts and notations needed. We should also add a warning: if you don’t like 
generating functions, don’t read this appendix! 


A.l The “Other” Generating Function(s) 
for the Bernoulli Numbers 

Given a sequence of interesting numbers {a„}„> o, one often tries to understand 
them by using the properties of the corresponding generating functions. The two 
most popular choices for these generating functions are YT=o a " x " (“ordinary 
generating function”) and YT=o a n x '' / (“exponential generating function”). 
Usually, of course, at most one of these turns out to have useful properties. For the 
Bernoulli numbers the standard choice is the exponential generating function (A.l) 
because it has an expression “in closed form.” What is not so well known is that the 
ordinary generating function of the Bernoulli numbers, i.e., the power series (A.2), 
even though it is divergent for all non-zero complex values of x, also has extremely 
attractive properties and many nice applications. The key property that makes it 
useful, despite its being divergent and not being expressible as an elementary 
function, is the following functional equation: 
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Proposition A. 1. The power series (A. 2) is the unique solution in Q[[.v]] of the 
equation 


nb "(r^) -«*>=*• (A5) 

Proof. Let {B n } be unspecified numbers and define f>(x) by the first equality 
in (A. 2). Then comparing the coefficients of x m in both sides of (A.5) gives 


m — 1 

E 

77=0 

This is the same as the standard recursion for the Bernoulli numbers obtained by 
multiplying both sides of (A. 1 ) by e x — 1 and comparing the coefficients of x m / m ! 
on both sides. □ 

The functional equation (A.5) can be rewritten in a slightly prettier form by 
setting 


= 


1 if m = 1 , 
0 if m > 1 . 


(A. 6) 


fi(x) = x f(x) = Y B " x n+l , 

n= 0 


in which case it becomes simply 

f l ( Y ^)-f l (x) = x 2 . (A. 7) 

A generalization of this is given by the following proposition. 

Proposition A.2. For each integer r > 1, the power series 


Pr 0 ) = 

11=0 




satisfies the functional equation 


~P'(x) = r/ +1 


(A. 8) 


(A. 9) 


and is the unique power series having this property. 

Proof. Equation (A. 9) for any fixed value of r > 1 is equivalent to the recur- 
sion (A. 6), by the calculation 
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^ r (l_ x ) - E 

n=0 

oo 

= E 

l=r 



Alternatively, we can deduce (A. 9) from (A. 7) by induction on r by using the easily 
checked identity 


and the fact that 


x- P' r (x) = r p r +i(x) (r> 1) 


) = (— ) 

dx \l-xJ \l-x/ \l-x/ 


(A. 10) 


(A. 11) 


for any power series F(x). □ 

We observe next that the definition (A. 8) makes sense for any r in Z, 2 and that 
the properties (A. 9) and (A. 10) still hold. But this extension is not particularly 
interesting since [i-k (x) for k e Z>o is just a known polynomial in I j x : 


P-k 0) 


OO 


E 



B n X 


n—k 


k 


E(- ] ) 




x k ~ n 



+ i) = (-■)'■ ft(-i). 


where B/, ( X ) is the fcth Bernoulli polynomial. (One can also prove these identities 
by induction on k, using either (A. 10) or else (A. 9) together with the uniqueness 
statement in Proposition A.2 and the corresponding well-known functional equation 
for the Bernoulli polynomials.) However, there is a different and more interesting 
way to extend the definition of p r to non-positive integral values of r. For k e Z, 
define 


Yk(x) = 

n > max(l ,—k) 


(» - D! 

(n + k)\ 


Bn+k x 


e xQ[[.v]]. 


Then one easily checks that y- r (x) = (r — I )! fi r (x) for r > 0, so that the 
negative-index power series yu are just renormalized versions of the positive-index 
power series But now we do get interesting power series (rather than merely 
polynomials) when k > 0, e.g. 


2 Or even in C if we work formally in x r Q[[x]]. 
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Yo(x) = 

n = 1 


B n x n 
n 


Yi(x) = 

11 = 1 


B „+ 1 x” 
n(n + 1) 


Yi{x) = 

n = \ 


B„+2X n 

n(n + \)(n + 2) 

(A. 12) 


The properties of these new functions corresponding to (A. 10) and (A. 9) are 
given by: 


Proposition A.3. The power series Yk(x) satisfy the differential recursion 

x 2 Yk(x ) = Yk-\(x) -fffX (k > 0) (A. 13) 

(with y-i(x) = f\(x)) as well as the functional equations 

Yo^YZff) ~ Yo(x) = log(l-x) + x, (A. 14) 


and more generally for k > 1 


Yk 


(t^) 


Yk(x) 


(-1)* 

k\ 


log(l-x) + Plfc-i(i) 


(A. 15) 


where Pk-\(X) is a polynomial of degree k — 1, the first few values of which are 
Pol A) = 1, Pi (A) = A - P 2 ( A) = A 2 - A + i P 3 (A) = A 3 - §A 2 + 

iA + A P 4 (A) = X 4 - 2X 3 +\X 2 +\X - XX. 

Proof Equation (A.13) follows directly from the definitions, and then Eqs. (A. 14) 
and (A. 15) (by induction over k) follow successively from (A. 7) using the general 
identity (A. 1 1). □ 

We end this section with the observation that, although f(x) and the related 
power series f r (x) and y/, (x ) that we have discussed are divergent and do not give 
the Taylor or Laurent expansion of any elementary functions, they are related to 
the asymptotic expansions of very familiar, “nearly elementary” functions. Indeed, 
Stirling’s formula in its logarithmic form says that the logarithm of Euler’s Gamma 
function has the asymptotic expansion 


log P(A) ~ 


n= 2 


— n + 1 


i) 


as X -> oo, and hence that its derivative ty(X) (“digamma function”) has the 
expansion 


MX) ■= 


n a) 

P(A) 


log A — 


1 

2X 


oo R 1 

£-^*- = 108 * -»(--) 


244 


Appendix: Curious and Exotic Identities for Bernoulli Numbers 


as X — > oo, with yo(x) defined as in Eq. (A. 12), and the functions /3,(x) correspond 
similarly to the derivatives of t// (x ) (“polygamma functions”). The transformation 
x i — ^ x/(l — x) occurring in the functional equations (A. 5), (A. 9), (A. 14) and (A. 15) 
corresponds under the substitution X = — 1 /x to the translation X m- X + 1 , and the 
compatibility equation (A.l 1) simply to the fact that this translation commutes with 
the differential operator d/dX , while the functional equations themselves reflect 
the defining functional equation X + I ) = XT(X) of the Gamma function. 


A.2 An Application: Periodicity of Modified Bernoulli 
Numbers 

The “modified Bernoulli numbers” defined by (A. 3) were introduced in [A14]. 
These numbers, as already mentioned in the introduction, occurred naturally in a 
certain elementary derivation of the formula for the traces of Hecke operators acting 
on modular forms for the full modular group [A15]. They have two surprising 
properties which are parallel to the two following well-known properties of the 
ordinary Bernoulli numbers: 

n > 1 odd =± B„ = 0 , (A. 16) 

x . 1 

n > 0 even =± B„=— > — (mod 1) (A. 17) 

P 

p prime 

( p - i)i« 


(von Staudt-Clausen theorem). These properties are given by: 

Proposition A.4. Let B* (n > 0) be the numbers defined by (A. 3). Then for n odd 
we have 


| ±3/4 i fn= ± 1 (mod 12), 

| ±1/4 ifn= ±3 or ±5 (mod 12), 


and for n even we have the modified von Staudt-Clausen formula 


~nB* — B n = 


p prime 

(P+I)|n 


1 

P 


(mod 1) . 


(A. 18) 


(A. 19) 


Remark. The modulo 12 periodicity in (A. 18) is related, via the above-mentioned 
connection with modular forms on the full modular group SL(2, Z), with the well- 
known fact that the space of these modular forms of even weight k > 2 is the sum 
of k/ 12 and a number that depends only on k (mod 12). 
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Proof. The second assertion is an easy consequence of the corresponding 
property (A. 17) of the ordinary Bernoulli numbers and we omit the proof. (It is 
given in [A15].) To prove the first, we use the generating functions for Bernoulli 
numbers introduced in Sect. A. 1 . Specifically, for 1 e Qwe define a power series 
g\{t) e Q[[t]\ by the formula 

“ (,) = ^( | -Af +t = ) ~ l 0 S (1 ~ Af + <’> ' 

where yo(x) = J2n>o B n x n /n is the power series defined in (A. 12). For X = 2 this 
specializes to 


oo . oo 

ft(0 = £ -y 7737^7 - 2 W - o = 2 £ B *n tn ■ < A - 2 °) 

with B* as in (A. 3). On the other hand, the functional equation (A. 14) applied to 
x = t/(l — Xt + t 2 ), together with the parity property yo(x) + x = yo(— x), which 
is a restatement of (A. 16), implies the two functional equations 

t 

gi+iCO = g\(t) + { _ Xt + t 2 = g-xi-n 

for the power series gx. From this we deduce 

gi{t) ~ gi{-t) = (g 2 (t) - gi(0) + (gi(0 ~ go(0) + (gott) -g-i(t)) 

t t t 3t — f 3 — t 5 + t 1 + t 9 — 3t 11 

~l-t + t 2+ l + t 2+ \+t + t 2 ~ 1 - r 12 

and comparing this with (A. 20) immediately gives the desired formula (A. 18) for 
B* , n odd. □ 

We mention one further result about the modified Bernoulli numbers from [A15], 
The ordinary Bernoulli numbers satisfy the asymptotic formula 

2 n \ 

5„~(— 1) ( "~ 2) / 2 (n -5- oo, n even). (A. 21) 

(2jt)" 

As one might expect, the modified ones have asymptotics given by a very similar 
formula: 


5*~(— 1)*" 2) / 2 — (n -a- oo, n even). (A. 22) 

" V (2 n) n 

The (small) surprise is that, while the asymptotic formula (A. 21) holds to all orders 
in l/n (because the ratio of the two sides equals £(n) = 1 + 0(2~")), this is not 
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true of the new formula (A. 22), which only acquires this property if the right-hand 
side is replaced by (— l)"/ 2 7r Y n ( 4n), where Y„ (x ) is the nth Bessel function of the 
second kind. 

Here is a small table of the numbers B* and B„ = 2 n B* — B„ for n even: 


n 

2 

4 

6 

8 

10 

12 

14 

16 

18 

20 

22 

K 

1 

27 

29 

451 

65 

6571 

571 

181613 

23663513 

10188203 

564133 

24 

80 

1260 

1120 

264 

12012 

312 

38080 

1220940 

83600 

552 


0 

8 

3 

3 

10 

136 

21 

-5 

4249 

330 

651 

13 

3056 

21 

109269 

170 

247700 

57 

38775 


A.3 Miki’s Identity 

The surprising identity described in this section was found and proved by 
Miki [A 10] in an indirect and non-elementary way, using p- adic methods. In 
this section we describe two direct proofs of it, or rather, of it and of a very similar 
identity discovered by Faber and Pandharipande in connection with Chern numbers 
of moduli spaces of curves. The first, which is short but not very enlightening, 
is a variant of a proof I gave of the latter identity [A2] (but which with a slight 
modification works for Miki’s original identity as well). The second one, which 
is more natural, is a slight reworking of the proof given by Gessel [A4] based on 
properties of Stirling numbers of the second kind. In fact, Gessel gives a more 
general one-parameter family of identities, provable by the same methods, of which 
both the Miki and the Faber-Pandharipande identities are special cases. In Sect. A.4 
we will give yet a third proof of these identities, following I. Artamkin [A 1 ] . 

Proposition A.5 (Miki). Write B n = (— 1 )"B„/nforn > 0. Then for all n >2 we 
have 


n—2 


n — 2 


= E (" ) Bi B n-i+2H n B n , 


(A. 23) 


7=2 


7=2 


where H n = 1 + «+•••+ - denotes the nth harmonic number. 

n 2 n 


(Faber-Pandharipande). Write b g = (2 — 2 lg ) 
g > 0 we have 


B 2g 

(2 g)\ 


for g > 0. Then for all 


E 

gl+g2=g 

gl,g2>0 


(2g\ — 1)! (2g 2 — 1)! 
2(2g-l)\ 


bgi bg2 


£ ^2n d 

&2n 

2 n (2 n)\ 

77 = 1 


bg— /7 “I " bg . 


(A. 24) 
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First proof. We prove (A. 24), following [A2], Write the identity as a(g) = b(g) + 

oo oo 

c(g) in the obvious way, and let A(x) = a (§) x 2g ~ X , B{x) = ^ b(g)x 2g ~ x 

g = 1 g=i 

OO 

and C(x) = c(g) x 2g ~ 1 be the corresponding odd generating functions. Using 
g=i 

oo ( 

the identity b g x 2g ~ x = — , we obtain 

?= o c sinh* 


1 f x 

A(x) = - y'' b gi b g2 I t 2gl ~ x (x — t) 2g2 ~ x dt (by Euler’s beta integral) 

2 Jo 

= -[ (- — )(— )dt, 

2 Jo ' t sinh t ' \x — t sinh(x — ?)/ 

1 / sinh x \ 

— — log I ) , 

inh x V x / 


B(x) 


1 ^ 

— y 

cinh v ' 


— 2 2 "B 


2« x 2n 


C(x) = Y J b g f 

g=l J ° 


sinhx 2 n (2 «)! sinhx 

x 2g-i _ (2g-l 


x — t 


dt 


r 1 

( 1 

1 

+ y 

[_ x — t 

V sinh x 

sinh t / 


dt . 


and hence, symmetrizing the integral giving C(x) with respect to t — > x — t. 


2 A(x) — 2 C(x) 


r 1(1 — !_v_! !_) 

Jo (W sinhf/Vx — t sinh(x — ?)/ 


■t sinh(x — t)> 

-(— + ■!■)(—!— + f) 

V x — t f / V sinh x x > 


+ 


1 


1 1 
+ - 


1 


x — t sinh? t sinh(x — ?) 

1 x 


dt 


K- 


o Vsinh(?) sinh(x — t) sinhx t (x — t) 




dt 


1 


sinhx 


l°g( 


sinh t 


x — t 


t sinh(x — f) / 


t= o 


2 B(x) . 


A similar proof can be given for Miki’s original identity (A. 23), with “sinh” replaced 
by “tanh”. □ 

Second proof. Now we prove (A. 23), following the method in [A4]. Recall that the 
Stirling number of the second kind S(k.m) is defined as the number of partitions 
of a set of k elements into m non-empty subsets or, equivalently, as 1 / m ! times the 

number of surjective maps from the set { 1 , 2 ,k} to the set {1, 2 , ... ,m}. It can 

be given either by the closed formula 
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S(k, m) 


1 

ml 


£(-ir~ € 

1=0 



(A. 25) 


(this follows immediately from the second definition and the inclusion-exclusion 
principle, since l k is the number of maps from {1,2, ... ,k} to a given set of l 
elements) or else by either of the two generating functions 


Y, SQc.m) . 


k=0 




(1 — x){\ — 2x) ■ ■ ■ (1 — mx) ’ 
k (e x - l) m 


k=0 


mi 


(A. 26) 


both of which can be deduced easily from (A. 25). (Of course all of these formulas 
are standard and can be found in many books, including Chap. 2 of this one, where 
S(k. in) is denoted using Knuth’s notation {* }.) From either generating function one 
finds easily that S(k,m) vanishes for k < m, S (m , m ) = 1 , S (m + 1 , m) = n,2 + m , 
and more generally that S(m + n.m) for a fixed value of n is a polynomial in m 
(of degree 2 n, and without constant term if n > 0). Gessel’s beautiful and very 
natural idea was to compute the first few coefficients of this polynomial using each 
of the generating functions in (A.26) and to equate the two expressions obtained. It 
turned out that this gives nothing for the coefficients of m° and m 1 (which are found 
from either point of view to be 0 and B „ , respectively), but that the equality of the 
coefficients of m 1 obtained from the two generating functions coincides precisely 
with the identity that Miki had discovered! 

More precisely, from the first formula in (A.26) we obtain 


, oo n m - oo 

!og( + = J]log(— — ) = Y 

^ > 7=0 ' 7=1 J r = 1 


V + 2 r + 


CXJ 

-E( 




— m + 
r 2 


r ~'m 2 + ---)x' 


-)■ 


(the last line by the Bernoulli-Seki formula) and hence, exponentiating, 


77—2 


S(m + n.m) = B lt m+(nB „- 1 


(A. 27) 


while from the second formula in (A.26) and the expansion log((e A — 1 )/ x) = 
B n x n /n \ we get 

n> 0 
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S ( m + n , m ) 

/ m\t m\ / m\ x n / e x — \\ m 

= 0 + I") 0 + 2 ) ' ’ ' 0 + n ) * Coefficient of ^ in (— ) 

= ^1 + H„m j i^B n m + ^ ^ j H j 

= B n m+ (lH n B n + + ■■■ (n > 1). (A.28) 

Comparing the coefficients of m 2 /2 in (A. 27) and (A.28) gives Eq. (A. 23). □ 

Finally, we state the one-parameter generalization of (A. 23) and (A. 24) given 
in [A4]. For n > 0 denote by B„ (x) the polynomial B„(x)/n. 

Proposition A.6 (Gessel). For all n > 0 one has 

\(Bn-x{x) + Y / B i (x)l3 ll - i (x)j = {^B l B, l - i (x)+H n . x B n {x ) . (A.29) 

Gessel does not actually write out the proof of this identity, saying only that it can 
be obtained in the same way as his proof of (A. 23) and pointing out that, because 
1) = B„ and 2 2g B2 g (l/2) = (2 g — 1 )\b g , it implies (A. 23) and (A.24) by 
specializing to x = 1 and x = 1 / 2, respectively. 


A.4 Products and Scalar Products of Bernoulli Polynomials 

If A is any algebra over Q and c j o- e\, . . . is an additive basis of A, then each 
product c ,- e j can be written uniquely as a (finite) linear combination c fj e k f° r 
certain numbers cfj € Q and the algebra structure on A is completely determined 
by specifying the “structure constants” cfj . If we apply this to the algebra A = Q[x] 
and the standard basis e, = x ' , then the structure constants are completely trivial, 
being simply 1 if i + j = k and 0 otherwise. But the Bernoulli polynomials also 
form a basis of Q[.y], since there is one of every degree, and we can ask what the 
structure constants defined by B, ( x)Bj (x ) = cfj /?/ (x ) are. It is easy to see that 
cfj can only be non-zero if the difference r:=i + j — k is non-negative (because 
Bi (x)Bj ( x ) is a polynomial of degree i + j ) and even (because the 77th Bernoulli 
polynomial is (— l)"-symmetric with respect to v \—r 1 — x). The surprise is that, 
up to an elementary factor, cf is equal simply to the fcth Bernoulli number, except 
when k = 0. This fact, which was discovered long ago by Nielsen [All, p. 75] 
(although 1 was not aware of this reference at the time when Igor Artamkin and I 
had the discussions that led to the formulas and proofs described below), is stated in 
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a precise form in the following proposition. The formula turns out to be somewhat 
simpler if we use the renormalized Bernoulli polynomials B„(x) = 11 "f** rather 
than the B„ (x) themselves when n > 0. (For n = 0 there is nothing to be calculated 
since the product of any B , (x) with Bq(x) = 1 is just If (x).) 

Proposition A.7. Let i and j be strictly positive integers. Then 


Bj(x) B,(x) = J2 tUJ 


+ 7Wj 


Bit Bi+j-u(x) 


(A. 30) 


+ 


(-iy-i (/ _!)!(;-!)! 

U+jV- 


B. 


i+j 


Note that, despite appearances, the (constant) second term in this formula is 
symmetric in i and /', because if B, +J f 0 then i and j have the same parity. 

Proof. Write B,j(x) for the right-hand side of (A. 30). We first show that the 
difference between Bij(x) and B; (x)B j (x) is constant. This can be done in two 
different ways. First of all, using B n (x + 1) — B„(x) = x" _1 we find 


Bij(x + 1 ) - Bij(x) 


E 




‘ r 

i \2li 


+ 71 j 


j \2i n 


B 2 (X 


i+j— 21— l 




( Bi(x ) + l -x l ') + x' 1 (Bj (x) + l -x ] ^ 


= B, (x + 1 )Bj(x + 1) — Bj(x)Bj(x) . 


It follows that the B, j (x) — B-, (x)Bj (x) is periodic and hence, since it is also 
polynomial, constant. Alternatively, we can use that B' n (x) equals 1 for n = 1 and 
(« — 1 )25„_i (x) for n > 1 to show by induction on i + j that Bj j (x) and!?; ( x)Bj (x) 
have the same derivative (we omit the easy computation) and hence again that their 
difference is constant. To show that this constant vanishes, it suffices to show that the 
integrals of the two sides of (A. 30) over the interval [0,1] agree. Since the integral 
of B„ (x) over this interval vanishes for any n > 0, this reduces to the following 
statement, in which to avoid confusion with i = \/— I we have changed i and j to 
r and v. □ 

Proposition A.8. Let r and s be positive integers. Then 

[ B, (x) Bfx) dx = (-1)'— 1 Br+S . (A. 31) 

Jo (r + ^)! 


Proof. Here again we give two proofs. The first uses the Fourier development 


Bk (x) = 


k\ 

(2 rti ) k 


E : 

n£ Z 
n 7^0 


n 2 ni 


(0 < x < 1, k > 1) 


(A. 32) 
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discussed in Chap. 4, Theorem 4.11 of this book. (For k = 1 the sum converges 
only conditionally and one has to be a little careful.) Since the integral /J e 2nikx dx 
equals 8k, o, this gives 


f 


B r (x ) B s (x)dx = (— 1)' 


r] s\ 


(2 iti) 


r+s 


E; 

n€Z 
n 7^0 


i r+5 


(- 1)'- 1 


/Mi! 

B, 

(r + 5 )! 


as desired. (The second equality, giving the well-known connection between 
Bernoulli numbers and the values at positive even integers of the Riemann zeta 
function, is just the case k = r + s, x 0 of (A. 32).) The second proof, using 
generating functions, is just as short. Denote the left-hand side of (A. 31), also for r 
or 5 equal to 0, by I r s . Then we have 


^ t r ~ l i /- 1 _ e xt e x “ 
KS r\ 5 ! J 0 e t — le u —l 

r,s> 0 


dx 


1 


1 


a‘ + u — 1 


e‘ — 1 e“ — 1 t + u 


= T[2 L.1 

t + 11 [e r — 1 e~ u — 1 


00 U +k—\ ( ,.\k—l 

E Bk t — (— u) 

k\ t + u 


1 

t it 


+ E 


Bk 

k\ 


E ^ 

r,s> 1 
r+s=k 


(-«r 1 , 


and Eq. (A. 31) follows by equating the coefficients of t r 1 id 1 . 

Before continuing, we show that Proposition A. 7 immediately yields another 
proof of the identities of Miki and Gessel discussed in the preceding section. This 
method is due to I. Artamkin [Al] (whose proof, up to a few small modifications, 
we have followed here). Indeed, summing (A. 30) over all i, J > 1 with i + j = n, 
and using the easy identities 


£ 




(r > 0) 


and 


£ (i ~ l,_ ( n7 - = £ J\-*y-‘o 


n-l r 1 


i.j> 1 

i + j =n 


(n - 1)! 


dx 


i = 1 


= [ [(1 -x)"- l -(-x) n - l ]dx = 

Jo 


1 + (-1)” 
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(where the first equation is the beta integral again), we obtain 

\ = H n - 1 Bn(x) + E (" 7 Sr(0)B„-,(jc) + , 

i,j> 1 r=2 \ ' / n 

i+j=n 

(A. 33) 

which is equivalent to Gessel’s identity (A. 29). 

Proposition A. 8 describes the scalar products among the Bernoulli polynomials 
with respect to the scalar product ( f g ) = fj f(x)g(x)dx. It is more natural to 
replace the Bernoulli polynomials (x) by their periodic versions Bk(x) (defined 
for x f Z as B, t(x — [a]) or by the right-hand side of (A. 32), and for x G Z by 
continuity if k ^ 1 and as zero if A; = 1), since then the scalar product is simply the 
integral of B , {x) B s {x) over the whole domain of definition M/Z. The first proof 
just given then carries over almost unchanged to give the following more general 
result: 

Proposition A.9. Let r and s be integers > 1 and a, ft two real numbers. Then 

[ B r {x + a) B s (x + P)dx = (~l) r ~ 1 ' \ S \ B r+S (a - fi) . (A.34) 

Jo (t + .s)! 

Using this, one finds, with almost the same proof as before, the following 
generalization of Proposition A. 7: 

Proposition A.10. Let i and j be positive integers. Then for any two variables x 
and y we have 


ma x(i,;') 

B i (x)Bj{y) = Y J 

m= 0 


1 

i \ m > 


+ j —m (y ) "I" 


(~i ru 

j \m 


+ j—m 00 


, , tw _! (i — 1)! O’ — 1)! D+ , 

+ I-1V ([ . +;) | 

where B+ (x) denotes the symmetrized Bernoulli polynomial 


B m ( x - y) 


(A. 35) 


D+ ,^ _ B m (x) + (-1 ) m B m {-x) _ B m (x + 1) + B m (x) _ D t ^ | m m _ x 

V X ) — 2 — 2 — ' 2 ' X 


The same calculation as was used above to deduce (A. 33) from (A. 30), but now 
applied to (A.35) instead of (A.30), gives the following generalization of Gessel’s 
identity (A. 29): 
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E Bi ( x)Bj (y) - H n —\ (b„ (x) + B„ (y)) 
i.j> 1 

i+j=n 


n - 1 

= E 

m = 1 


n — 1 ' 

k w , 


(B n - m (y) + (~l) m B n - m (x)) 


B m C* - y) 


m 


+ 


1 + (-!)" n + 


Bf(x-y). (A. 36) 


We observe that Eq. (A. 36) was also found by Hao Pan and Zhi-Wei Sun [A12] in a 
slightly different form, the right-hand side in their formula being 


E 



B m (x - y) 

771 2 


"t" Bfj—m^x') 


B m (y — x) \ 
m 2 ) 


1_ B„(x) - BJj) 
n x — y 


(A. 37) 


which is easily checked to be equal to the right-hand side of (A. 36); their formula 
has the advantage of being more visibly symmetric in x and y and of using only the 
Bernoulli polynomials B„,(x) rather than the symmetrized Bernoulli polynomials 
B+(x), but the disadvantage of having a denominator x — y (which of course 
disappears after division into the numerator B n (x)—B„ (_y)) rather than being written 
in an explicitly polynomial form. 

We end this section by giving a beautifully symmetric version of the multiplica- 
tion law for Bernoulli polynomials given by the same authors in [A13], 

Proposition A.ll (Sun-Pan). For each integer n > 0 define a polynomial 

in four variables r, s, x and y by 


;;] v. i, (;)(;.)«, ( a. 38 ) 

i+j=n 

Thenfor any six variables r, s, t, x, y and z satisfying r+s+t = n and x+y+z = 1 
we have 


r s 

+r 

s t 


t r 

= o . 

(A. 39) 

x y_ 

n 

_y a 

n 

_z x_ 

n 



First proof (sketch). We can prove (A. 39) in the same way as (A. 36) was proved 
above, replacing the product Bj(x)Bi(y) in (A.38) for i and j positive using 
formula (A. 35) (with x and y replaced by 1 — y and x) and then using elementary 
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binomial coefficient identities to simplify the result. We do not give the full 
calculation, which is straightforward but tedious. □ 

Second proof. An alternative, and easier, approach is to notice that, since the left- 
hand side of (A. 39) is a polynomial in the variables x, y and z = 1 — x — y, it is 
enough to prove the identity for x, y. z > 0 with x + y + z = 1 . But for x and y 
between 0 and 1 we have from (A. 32) 


(2jt if 


r s 

x y. 


Y] C n (r,s;a,b)i 


,2jri(bx—ay) 


,be Z 


with 


C„(r , s',a.b) 


T.i.j>i.i+j= n ^)i(s)ja ‘b J ifa^ 0,6^0 

— ( r) n a~" if a 0, b = 0 

— ( s) n b~ n if a = 0, b 0 

0 if a = 0, b = 0 


where (x) m = x(x — 1) ■ ■ ■ (x — m + 1) is the descending Pochhammer symbol. 
Equation (A. 39) then follows from the identity 


t C„(r, 5 ; a, b)+r C„(s , t; b , c)+s C„(f, r; c,a) = 0 (a+b+c = 0, r+s+t = n ) . 


whose elementary proof (using partial fractions if abc f 0) we omit. □ 

We end by remarking on a certain formal similarity between the cyclic iden- 
tity (A. 39) and a reciprocity law for generalized Dedekind sums proved in [ A5] . 
The classical Dedekind sums, introduced by Dedekind while posthumously editing 
some unpublished calculations of Riemann’s, are defined by 

/z / b h \ 

s(b,c) = E ( b , c e N coprime), 

h (mod c ) 


where B\(x) as usual is the periodic version of the first Bernoulli polynomial (equal 
to x — | if 0 < x < 1 , to 0 if x = 0, and periodic with period 1), and satisfy the 
famous Dedekind reciprocity relation 


s(b, c) + s(c , b) 


b 2 + c 2 + 1 1 

\2bc 4 ' 


This was generalized by Rademacher, who discovered that if a, b and c are pairwise 
coprime integers then the sum 
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which equals s(a', c) for any a ' with aa'=b (mod c) or ba'=a (mod c ), satisfies 
the identity 


s(a , b\ c ) + s(b , c; a) + s(c , a; b ) 


a 2 + b 1 + c 2 _ 1 
\2abc 4 


(A. 41) 


A number of further generalizations, in which the functions B\ in (A.40) are 
replaced by periodic Bernoulli polynomials with other indices and/or the arguments 
of these polynomials are shifted by suitable rational numbers, were discovered later. 
The one given in [A5] concerns the sums 



— ^ B m ^ 

h (mod c) 



h + z 
c 



(A.42) 


where m and n are non-negative integers, a, b and c natural numbers with no 
common factor, and x, y and z elements of T:=R/Z. (The /tth summand in (A.42) 
depends on z modulo c, not just modulo 1, but the whole sum has period 1 in z.) 
For fixed m and n these sums do not satisfy any relation similar to the 3-term 
relation (A.41) for the case m = n = 1, but if we assemble all of the functions 
S„ un (in , n > 0) into a single generating function 


( a b c\ 
x y z ) 

X Y Zj 


E . 

m,n> 0 


m ! n ! 


(fr ©- 1 


(A. 43) 


in which X, Y and Z (which does not appear explicitly on the right) are formal 
variables satisfying X + Y + Z = 0, then we have the following relation: 

Proposition A. 12 ([A5]). Let a, b, c be three natural numbers with no common 
factor, x, y, z three elements of T, and X , Y, Z three formal variables satisfying 
X + Y + Z = 0. Then 

( b c a 
y z x 
Y Z X 


+ 6 



if (x,y,z) e (a,b,c) T, 
otherwise. 


6 


a b c 
x y z 
\X Y Z , 


+ 6 


We do not give the proof of this relation, since three different proofs (all similar 
in spirit to various of the proofs that have been given in this appendix) are given 
in [A5], but we wanted to at least mention this generalized Dedekind-Rademacher 
reciprocity law because of its formal resemblance, and perhaps actual relationship, 
to the Sun-Pan reciprocity law (A. 39). 
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A.5 Continued Fraction Expansions for Generating 
Functions of Bernoulli Numbers 


There are several classical formulas expressing various versions of the standard 
(exponential) generating functions of the Bernoulli numbers as continued fractions. 
A simple example is 


tanh x 




) 


1 + 


3 + 


x 

5 H 


(A. 44) 


whose proof is recalled below, and a somewhat more complicated one, whose proof 
we omit, is 


x / 2 ( _ B 2n x 2„ A 

tanh .v/ 2 V ^ ) 


1 

a\x 2 

Cl2X 2 

1 + 


(A. 45) 


with a n defined by 


ci n 


1 

~Y2 

(n + 1 )(n + 2) 

(2n — 2)(2n - l)(2n)(2« + 1) 
(n-2)(n - 1) 

(2 n - \)(2n)(2n + 1)(2/? + 2) 


if n = 1 , 


if n is even , 


if n > 1 is odd . 


It was discovered by M. Kaneko that the convergents P„ (x ) / Q„(x) of the continued 
fraction (A. 45) could be given in a simple closed form, namely 


n/2 

Pn(x) = J2 

;=0 



(in + l\ 1 x' 

{ 2 i ) (2 i + 1)! 


Qn(x) = 




77 + 1 
2/ 


-l 


( 2 /)! 
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if n is even and a similar but slightly more complicated expression if n is odd. 
(It was in connection with this discovery that he found the short recursion formula 
for Bernoulli numbers discussed in Sect. 1.2 of the book.) Again we omit the proof, 
which is given in [A6]. 

What is perhaps more surprising is that there are also nice continued fraction 
expansions for certain non-standard (ordinary) generating functions of Bernoulli 
numbers of the type considered in Sect. A. 1 , and these are in some sense of even 
more interest because the continued fractions, unlike the power series themselves, 
converge for positive real values of the argument (and give the appropriate deriva- 
tives of ij/(X) as discussed in the last paragraph of Sect. A.l). For instance, on the 
cover of the Russian original of Lando’s beautiful book on generating functions [A7] 
one finds the pair of formulas 3 


3 5 7 

X X X 

1-X + 2--+ 16.- + 272-- + , 


tan x 


1 ■ x + 2 • x 3 + 16 • x 5 + 272 • x 1 + • 


1 - 


l-2x 2 


2 ■ 3 x 2 


1 


3 ■ 4x 2 

1 


The numbers 1, 2, 16, 272, . . . defined by the first of these two formulas are just 
the numbers (4" — 2")\B„\/n, so the second formula gives a continued fraction 
expansion for the non-exponential generating function for essentially the Bernoulli 
numbers. Again we omit the proof, referring for this to the book cited, mentioning 
only the following alternative and in some ways prettier form of the formula: 


1 2 16 272 

X~X^ + X^~Y r + 


1 


X + 


1 


(A. 46) 


X 

J + T 


J + ' 


in which the continued fraction is convergent and equal to 1 — 4 ( ir ( ) ) 
for all A' > 0. 

Other continued fraction expansions for non-exponential Bernoulli number 
generating functions that can be found in the literature include the formulas 


3 In the English translation [A8] (which we highly recommend to the reader) this formula has been 
relegated to the exercises: Chapter 5, Problem 5.6, page 85. 
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£ B2,Mxf = — - 


1 + — H 

2 1 1 
- + - + 


2 3 11 x 

3 + 4 + _ 


or the equivalent but less appealing identity 


£ B n x n = 


1 + 


3 + 


2x 


2x 


5 + 


3x 


2 3x 

3 


7 + 


4x 


2 4x 


4 5x 

9+ — 


and 


£ (2 n + 1) B ln x n = 


1 + 1 + 


1 + “ + 

2 1 1 


— + — H 

2 2 11 


2 + 5 + 1 r 


— + — + — 
3 3 


all given by J. Frame [A3] in connection with a statistical problem on curve fitting. 

For good conscience’s sake we give the proofs of one continued fraction of each 
of the two above types, choosing for this purpose the two simplest ones (A. 44) 
and (A. 46). We look at (A.44) first. Define functions 7 q, I\, . . . on (0, oo) by 


I n (ci) 


f 


t n (l-t/a) n 


n\ 


dt (fl G Z>o , Cl G R>o) . 
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Integrating by parts twice, we find that 

L 


r a d 

In+i{a)= / e' — 


dt 


H 


2 p" +1 (l -t/a) n+l 
dt 2 _ ( n + 1)! 

r" _I (l -t/a) n ~ x 4/7 + 2 f"(l — t /d) n 


in - 1)! 


n\ 


dt 


4 n + 2 

= I„(a) 


) 4;/+2 I„+i(a) 

for /7 > 0. Rewriting this as = 1 and noting that 


In (a) 


In (n) 
2' 


Io(n) — £ a — 1 , /| (fl) — e a ^1 ^ + (l + — ) 

by direct calculation, we obtain 


1 


e lx + 1 1 h( 2x) 1 


1 


- + 


tanhx e 2x — 1 x Iq{2x) 


- + 


1 


x 5 1 

- + — 
x 


which is equivalent to (A.44). Similarly, for (A. 46), we define functions Jo, J\, 
on (0, oo ) by 


J n (X)= ( tanh (t/X))" e~' dt (n € Z> 0 , X € M >0 ) . 

Jo 

This time Jo(X) is simply the constant function 1, while J \ ( X ) has the exact 
evaluation 


X /X \ X / X 1 \ 

MX) =!--*(- + \)+-J (- + -), (A. 47) 

as is easily deduced from Euler’s integral representation 


f{x) = -y + 



1 - t 


1 - t 


dt , 


as well as the asymptotic expansion 


r°° / 1 2 t 3 16 t 5 

Jo Va 1 - X 3 3! + ~X~ 5 5! 
1 2 16 272 

i"F + F'r + "' 


272 t 7 \ 

1C 7! + 


e 1 dt 


MX) 
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as X — > oo. (This last expression can be written as 1 — Xyo(2/X) + Xyo(^/ X) 
with yo as in (A. 12), in accordance with (A. 47) and the relationship between yo{X) 
and \[r(X) given at the end of Sect. A.l.) On the other hand, integrating by parts and 
using tanh(x)' = 1 — tanh(.v) 2 , we find 


J„(X ) = J e~ 1 -^-((tanh (f/X))") dt 

= — J e~‘ (tanh (t/X))" 1 ^1 — ( tanh(f/X))~^ dt 
= j(j n - l {X)-J n+l (X)) 

, A , ., i . Jn-l(X) X J n + 1 (X) Ut . ^ w , 

for n > 0, and rewriting this as = 1 we obtain that J\(X) = 

Jn\X) n J n )X) 

J[(X) 

— has the continued fraction expansion given by the right-hand side of (A.46), 

MX) 

as claimed. □ 

We end this appendix by describing an appearance of the continued frac- 
tion (A.46) in connection with the fantastic discovery of Yuri Matiyasevich that 
“the zeros of the Riemann zeta function know about each other.” Denote the zeros 
of £ ( s ) on the critical line 91 (s) = \ by p„ and with 0 < 3(pi) < S(p 2 ) < ••• 
and for M > 1 consider the finite Dirichlet series Am Ov) defined as the N x N 
determinant 4 


A m (s) = 


1 1 1 11 


n ~Pl n ~P 1 ... n -PM n PM n s 


N~ Pl N~ Pl ■ ■ ■ N~ PM N~ PM N~ s 


where N = 2 M + 1 . This function clearly vanishes when ,s' = p„ or p 7 for 1 < 
n < M, but Matiyasevich’s discovery (for which we refer to [A9] and the other 
papers and talks listed on his website) was that its subsequent zeros are incredibly 
close to the following zeros of the Riemann zeta function, e.g., the first zero of 
A 50 on i + K>o following pso differs in absolute value from psi by less than 4 x 
10 -15 , the first zero of A 1500 after P 1500 differs in absolute value from P 1501 by less 
than 5x 10 -1113 , and even the 300th zero of A 1500 after P 1500 differs in absolute value 
from p 1801 by less than 5 x 10 -766 ! Moreover, if we write the Dirichlet series Am(s ) 
as cm 12„=i a M,n n ~ s with the normalizing constant Cm chosen to make Um.\ = 1, 


4 We have changed Matiyasevich’s notations slightly for convenience of exposition. 
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then it turns out that the function cjj 1 Am (s) not only has almost the same zeros, 
but is itself a very close approximation to (1 — 2 1-1 )£(.$) over a long interval of the 
critical line. 

In studying this latter function, Matiyasevich was led to consider the real 
numbers vm defined by vm = 4M where Hmm denotes the 

coefficient of n~ s in the Dirichlet series cj 4 ] Since by the nature 

of his investigation he was working to very high precision, he obtained very 
precise decimal expansions of these numbers, and in an attempt to recognize 
them, he computed the beginning of their continued fraction expansions. (Recall 
that rational numbers and real quadratic irrationalities can be recognized 
numerically by the fact that they have terminating or periodic continued 
fraction expansions.) To his surprise, when M was highly composite these 
numbers had very exceptional continued fraction expansions. For instance, for 
2 M = l.c.m.{l, 2,..., 10} = 2520, the number has a decimal expansion 
beginning 0.9998015873172093 ■ ■ ■ and a continued fraction expansion beginning 
[0, 1, 5039, 2520, 1680, 1260, 1008, 840, 720, 630, 560, 504], In view of the 
fact that nearly all real numbers (in a very precise metrical sense) have continued 
fraction expansions with almost all partial quotients very small, this is certainly 
not a coincidence, and it is even more obviously not one when we notice that 
the numbers 5040, 2520, . . . 504 are 5040/« for n = 1, 2, .... 10. This leads 
one immediately to the continued fraction (A. 46) with X = AM and hence, in 
view of the evaluation of that continued fraction given above, to the (conjectural) 
approximation v M ss \ty(M + 1) — \r]/(M + ^), which turns out indeed to be a 
very good one for M large, the two numbers differing only by one part in 10 108 in 
the above-named case 2 M = 2520. We take this somewhat unusual story as a fitting 
place to end our survey of curious and exotic identities connected with Bernoulli 
numbers. 
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